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CLASSIFICATION OF PO! 




PEPTIDES BY LIGAND GEOMETRY AND 



.TED METHODS 



BACKGROUND OF THE INVENTION 



The present invention relates generally to 



interactions between ligands and polypeptides and more 
specifically to determining structure-related properties of 
a ligand when bound to different polypeptides. 



chemistry and biology due to the correlation between the 
structure of a molecule and its function. Although a full 
understanding of this correlation is not yet established, 
one can gain insight into the function of a molecule from 
its deduced structure. Thus, the structure can provide a 
strong basis for formulating experiments to determine 
function. Conversely, the eventual disclosure of a 
structure for a well studied molecule can have a significant 
effect in converging apparently disparate observations of 
function into a consistent description of the molecule's 
activity. 



increasingly dependent upon structure information include, 
for example, the production of therapeutic drugs. 
Therapeutic drugs can be designed by synthesizing a molecule 
that mimics a ligand known to interact with a target 
receptor. Alternatively, a therapeutic drug can be designed 
by computer assisted methods in which a molecule is designed 
to dock to a binding site on a receptor of known structure. 



Structure determination plays a central role in 



Practical applications which are becoming 



By structure -based methods such as these, lead compounds can 
be identified for further development. 

Using a similar structure based approach a 
receptor can be engineered to yield improved or novel 
functions. For example, changes can be made at a ligand 
binding site in a polypeptide receptor based on the known 
structure of the receptor. Given that a polypeptide 
receptor can contain hundreds or even thousands of amino 
acid residues, of -which only a few may contact a ligand, 
structural information is useful in identifying where 
changes should be made in the polypeptide to alter ligand 
binding. Polypeptide receptors engineered as such can be 
used for a variety of practical applications including, for 
example, industrial catalysis, therapeutics, and 
bioremediation . 

Although methods for structure . determination are 
evolving, it is currently difficult, costly and time 
consuming to determine the structure of a polypeptide or 
ligand. It can often be even more difficult to produce a 
polypeptide -ligand complex in a condition allowing 
determination of a structure for the bound complex. 
Resorting to determining a structure for the receptor 
individually can have limited value, particularly if the 
location of ligand binding is difficult to identify due to 
the large size of most polypeptide receptors. Similarly, 
determination of a structure of an unbound ligand can have 
limited usefulness because an unbound ligand has multiple 
conformations and the most stable conformation of an unbound 
ligand is often different from its conformation when bound 
to a receptor. 
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Theoretical modeling of ligand-polypeptide 
interactions is one alternative that has been attempted in 
cases where the structure of the polypeptide-ligand complex 
is not available. In this approach a ligand is fitted to a 
5 structure of a polypeptide. The polypeptide structure used 
can be determined empirically or theoretically. Theoretical 
determination of a hypothetical molecular structure for a 
polypeptide by ab initio methods is a relatively undeveloped 
method. Another theoretical approach, referred to as 
10 homology modeling, has been used to infer structure based on 
comparison with molecules of known structure. 

The successful application of homology modeling to 
determining polypeptide-ligand interactions relies upon 
choosing a correct polypeptide template for comparison. In 
most cases criteria for comparison are unavailable or 
unreliable. For example, it is common to produce a 
hypothetical structure of a target polypeptide based on the 
empirically determined structure of a template polypeptide 
having similar sequence. However, similarities in sequence 
do not always yield similar structures and conversely, 
similar structures have been observed for two polypeptides 
having significantly diverged sequences. 

Thus, there exists a need for efficient methods to 
identify properties of a ligand that confer binding 
25 specificity for polypeptide receptors. A need also exists 
for methods to classify polypeptides and ligands according 
to structural characteristics. The present invention 
satisfies this need and provides related advantages as well. 
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SUMMARY OF THE INVENTION 



The invention provides a method for identifying a 
pharmacocluster . The method includes the steps of (a) 
determining bound conformations of a ligand bound to 
different polypeptides, and (b) clustering two or more bound 
conformations of the ligand having substantially the same 
bound conformation, thereby identifying a pharmacocluster. 
The invention also provides a method for identifying a 
member of a pharmacocluster . The invention also provides a 
method for identifying a polypeptide pharmacof ami ly. The 
method includes the steps of (a) determining bound 
conformations of a ligand bound to different polypeptides of 
a polypeptide family, and (b) identifying two or more bound 
conformations of the ligand having substantially different 
bound conformations, thereby identifying at least two 
polypeptide pharmacof amilies exhibiting binding specificity 
for the two or more substantially different bound 
conformations of the ligand. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows pharmacoclusters identified from a 
database of 156 bound structures of nicotinamide adenine 
dinucleotide or nicotinamide adenine dinucleotide phosphate. 
Structures were generated using the overlay function in 
INSIGHT 9 8 (Molecular Simulations Inc., San Diego, CA) . 

Figure 2 shows the nomenclature used herein for 
atom names in the NAD(P) molecule. 
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Figure 3 shows conformer models with interacting 
atoms from bound polypeptide and ordered waters overlayed. 
Models in parts A through H were derived from 
pharmacoclusters 1-8, respectively as described in the 
5 Examples. Overlayed atoms and waters are identified as 
either hydrogen bond donors (donors) , hydrogen bond 
acceptors (acceptors) , sulfurs (sulfurs) , waters (waters) , 
or atoms that can be hydrogen bond acceptors or hydrogen 
bond donors (acceptors/donors) according to the legends 
10 under each conformer model. 

Figure 4 shows a portion of a 2D I^H^H] NOESY 
spectrum recorded with a 0.2 ml sample of 1 mM NADP and 200 
\M of enzyme 1-deoxy D-xylulose 5 -phosphate reductoisomerase 
(DOXP) . Atoms are identified according to Figure 2. Spectra 
15 are reported as parts per million (ppm) . Since the ligand 
is in fast exchange and is in excess over polypeptide, cross 
peaks represent transferred NOEs . 

Figure 5 shows high affinity binding of compound 
TTE0001 . 001 . A07 to polypeptide enzymes of pharmacof amily 1 

20 (panel A) and pharmacof amily 8 (panel B) . Double reciprocal 
plots of reaction rate versus concentration of NADH (panel 
A) or NADPH (panel B) are shown for each enzyme in the 
presence of various concentrations of compound 
TTE0001.001.A07. Concentrations of compound TTE0001 . 001 .AO 7 

25 shown to the right of the plot A correspond 7 . 1 \xM (open 

triangles), 3.6 jaM (closed triangles), 1.8 |aM (open circles) 
and no added compound (closed circles) . Concentrations of 
compound TTE0001 . 001 .AO 7 shown to the right of the plot B 
correspond 56.2 |iM (open triangles), 37.5 jiM (closed 



triangles), 18.7 (iM (open circles) and no added compound 
(closed circles) . Inhibitory dissociation constants (K is ) 
determined from the data are shown in the upper left corner 
of the respective plot. 

Figure 6 shows high affinity binding of compound 
TTE0001 . 002 . D02 to a polypeptide enzyme of pharmacof amily 1. 
A double reciprocal plot of reaction rate versus 
concentration of NADH is shown for the enzyme in the 
presence of various concentrations of compound 
TTE0001 ..002 . D02 . .Concentrations of compound TTE0001 . 002 . DO 2 
shown to the right of the plot A correspond 2 0.6 jiM (open 
triangles), 13.7 jiM (closed triangles), 6.9 |iM (open 
circles) and no added compound (closed circles) . An 
inhibitory dissociation constant (K is ) determined from the 
data is shown in the upper left corner of the plot.. 

Figure 7 shows a pharmacophore model derived from 
the coordinates presented in Table 3 for pharmacof amily 1. 
Figure 7A shows a feature of the pharmacophore model 
including a volume defining the shape of conformer model 1 
which is indicated by grey spheres and superimposed on the 
conformer model having coordinates listed in Table 3C. 
Figure 7B shows three features of the pharmacophore model 
including a hydrophobic region of the nicotinamide ring, a 
hydrogen bond acceptor positioned at the averaged 
coordinates for the location of 17 hydrogen bond acceptors 
in the polypeptides of pharmacof amily 1, and a hydrogen bond 
donor positioned where a hydrogen bond donor of a ligand 
would be expected to have favorable interactions with 
hydrogen bond acceptors observed in 11 of the 17 
polypeptides in pharmacof amily 1. Figure 7C shows a 
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combination of features of figures 7A and 7B present in a 
pharmacophore model and superimposed on the conformer model. 



DETAILED DESCRIPTION OF THE INVENTION 

The invention provides pharmacoclusters and 
5 methods for identifying a pharmacocluster from bound 

conformations of a ligand bound to different polypeptides. 
The methods are applicable for identifying a conformation- 
dependent property of a ligand based on bound conformations 
of the ligand in a pharmacocluster. The- methods are also 
10 applicable for classifying polypeptides, from a family of 
polypeptides that bind the same ligand, into 

pharmacof amilies based on bound conformations of the ligand. 
Accordingly, methods are provided for grouping polypeptides 
into pharmacof amilies by determining bound conformations of 

15 a ligand or a conformation-dependent property of a ligand 
independent of a determination of the structure of the 
polypeptide. An advantage of classifying polypeptides 
according to bound conformations of a ligand is that a 
pharmacof amily is likely to contain polypeptides having 

20 greater binding specificity for a particular molecule than 
other polypeptides in the same family. Thus, the methods 
allow identification of a pharmacof amily that can 
specifically interact with a particular therapeutic agent or 
drug . 

25 Additionally, the methods of the invention can be 

used to determine a conformer model or pharmacophore model 
based on a bound conformation or conformation-dependent 
property of a ligand bound to polypeptides in a 
pharmacof amily . The invention is therefore advantageous in 
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providing a model for the design and identification of 
therapeutic compounds having specificity for a 
pharmacof amily of polypeptides. 

Another advantage of the invention is that the 
methods provide a correlation between ligand conformation, a 
parameter that is relatively easy to measure, and 
polypeptide structure, a parameter of tremendous value but 
often difficult to measure. Therefore, the methods of the 
invention can be used to determine structural • 
characteristics of a polypeptide based on a conformation^ 
dependent property of a bound ligand. 

As used herein, the term "pharmacocluster" refers 
to a collection of substantially the same bound 
conformations of a ligand, or portion thereof, bound to two 
or more polypeptides. A member conformation of a 
pharmacocluster can have (1) a conformation that is more 
similar to an average conformation of the members in its 
pharmacocluster than to any other pharmacocluster and (2) a 
conformation that is more similar to an average conformation 
of the members in its own pharmacocluster than the most 
similar average structures from different pharmacoclusters 
are to each other, wherein the pharmacoclusters consist of 
conformations of the same ligand or portion thereof. The 
pharmacocluster is determined for a. ligand bound to 
different polypeptides but does not require that a structure 
of the polypeptide be known or included as part of a bound 
conformation of a ligand. A bound conformation of a ligand 
can include the entire ligand structure or selected atoms 
including a portion of the complete atomic composition of 
the ligand so long as the number of atoms provides 



P 



sufficient information to distinguish one pharmacocluster 
from another. A pharmacocluster can include both the bound 
conformations of a ligand, or portion thereof, and one or 
more atoms that both interact with the ligand and are from a 
5 bound polypeptide. Thus, a pharmacocluster can include 
conformational information of 1 or more, 2 or more, 5 or 
more, 10 or more, 2 0 or more, 3 0 or more, 4 0 or more, 50 or 
more or 100 or more atoms of a ligand bound conformation. 

Accordingly, portions of bound conformations of 

10 two or more different ligands can be Included in =a ligand 
pharmacocluster so long as the portions selected from each 
ligand have a core bound conformation that is substantially 
the same. A core bound conformation can consist of portions 
of bound conformations of ligands wherein the portions have 

15 identical structural formula and conformation. A core bound 
conformation can also consist of portions of bound 
conformations of ligands wherein the portions have different 
structural formulas so long as the portions have 
substantially the same conformation. The structural 

20 formula, as it is understood in the art, is a 2 dimensional 
representation of a molecule that identifies the atoms and 
covalent bonds between each atom in the molecule. The 
structural formula does not necessarily include information 
sufficient to determine conformation of a molecule. For 

25 example, a common structural formula representation of 

cyclohexane can be a hexagon with 2 hydrogens attached to 
each carbon being in equivalent positions. However, a 
stable conformation of cyclohexane in solution may appear as 
a "chair" or "boat" shape with hydrogens in either axial or 

30 equitorial positions relative to the molecular plane. 
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As used herein, the term "conformation-dependent 
property," when used in reference to a ligand, refers to a 
characteristic of a ligand that specifically correlates with 
the three dimensional structure of a ligand or the 
5 orientation in space of selected atoms and bonds of the 

ligand. Thus, a ligand bound to a polypeptide in a distinct 
conformation will have at least one unique conformation- 
dependent property correlated with the bound conformation of 
the ligand. A conformation-dependent property can be 

10 derived from or include the. entire ligand structure or 

selected atoms and bonds, including a fragment or portion of 
the complete atomic composition of the ligand. A 
conformation-dependent property that includes selected atoms 
and bonds of a ligand can include 2 or more, 3 or more, 5 or 

15 more, 10 or more, 15 or more, 20 or more, 25 or more, or 50 
or more atoms of a bound conformation of a ligand. 

A characteristic that specifically correlates with 
a three dimensional structure of a ligand is a 
characteristic that is substantially different between at 

20 least two different bound conformations of the same ligand 
and, therefore, distinguishes the two different bound 
conformations. A conformation-dependent property can 
include a physical or chemical characteristic of a ligand, 
for example, absorption and emission of heat, absorption and 

25 emission of electromagnetic radiation, rotation of polarized 
light, magnetic moment, spin state of electrons, or 
polarity. A conformation-dependent property can also 
include a structural characteristic of a ligand based, for 
example, on an X-ray diffraction pattern or a nuclear 

3 0 magnetic resonance (NMR) spectrum. A conformation-dependent 
property can additionally include a characteristic based on 
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a structural model, for example, an electron density map, 
atomic coordinates, or x-ray structure, A conformation- 
dependent property can include a characteristic 
spectroscopic signal based on, for example, Raman, circular 
5 dichroism (CD) , optical rotation, electron paramagnetic 
resonance (EPR) , infrared (IR) , ultraviolet/visible 
absorbance (UV/Vis) , fluorescence, or luminescence 
spectroscopies. A conformation-dependent property can also 
include a characteristic NMR signal, for example, chemical 

10 shift, J coupling, dipolar coupling, cross -correlation, 
nuclear spin relaxation, transferred nuclear Overhauser 
effect, or combinations thereof. A conformation-dependent 
property can additionally include a thermodynamic or kinetic 
characteristic based on, for example, calorimetric 

15 measurement or binding affinity measurement. Furthermore, a 
conformation-dependent property can include characteristic 
based on electrical measurement, for example, voltammetry or 
conductance . 

As used herein, "selected" conformation-dependent 
20 properties are identified to form a set of conformation- 
dependent properties that can include, for example, the 
entire set of conformation-dependent properties associated 
with the bound conformations of a ligand in a 
pharmacocluster or a subset of conformation-dependent 
25 properties associated with the bound conformations of a 
ligand in a pharmacocluster, so long as the subset of 
conformation-dependent properties are sufficient to identify 
a unique conformation of the ligand. A selected 
conformation-dependent property can include any of the above 
3 0 described properties, for example, a physical or chemical 
property, structural data, a structural model, a 
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spectroscopic signal, a thermodynamic or kinetic measurement 
or an electrical measurement . 



As used herein, the term "bound conformation," 
when used in reference to a ligand, refers to the location 
5 of atoms of a ligand relative to each other in three 
dimensional space, where the ligand is bound to a 
polypeptide. The location of atoms in a ligand can be 
described, for example, according to bond angles, bond 
distances, relative locations of electron density, probable 
10 occupancy of atoms at points in space relative to each 

other, probable occupancy of electrons at points in space 
relative to each other or combinations thereof. 



As used herein, a "selected" bound conformation 
refers to a set of bound conformations that can include, for 
j* 15 example, the entire set of defined, bound conformations or a 

subset of bound conformations of a ligand. 
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As used herein, the term "clustering" refers to 
assigning related bound conformations of a ligand, or 
portion thereof, into a first collection such* that the 

20 conformations residing in the first collection can be 

overlaid with substantial overlap and bound conformations 
from two different collections cannot be overlaid with a 
better overlap than that resulting from members of the first 
collection. Exemplary clustering of ligand conformations 

25 are disclosed herein (see Example I) . 



As used herein, the term "ligand" refers to a 
molecule that can specifically bind to a polypeptide. 
Specific binding, as it is used herein, refers to binding 
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that is detectable over non-specific interactions by 
quantifiable assays well known in the art. A ligand can be 
essentially any type of natural or synthetic molecule 
including, for example, a polypeptide, nucleic acid, 
carbohydrate, lipid, amino acid, nucleotide or any organic 
derived compound. The term also encompasses a cof actor or a 
substrate of a polypeptide having enzymatic activity, or 
substrate that is inert to catalytic conversion by the bound 
polypeptide. Specific binding to a polypeptide can be due 
to covalent or non covalent interactions. 

As used herein, the term "bound to two or more 
polypeptides , " when used in reference to a ligand is 
intended to refer to two or more complexes consisting of a 
ligand and a polypeptide. A complex can include, for 
example, a single ligand bound to a single polypeptide. A 
complex can also include a single ligand bound to more than 
one polypeptides including, for example, a complex in which 
a ligand is bound at the interface of interacting 
polypeptides. A complex can also include multiple ligands, 
however, conformation dependent properties of all ligands of 
the complex need not be identified. A complex results from 
a specific interaction between a polypeptide and a ligand. 

As used herein, the term "substantially the same," 
when used in reference to bound conformations of a ligand, 
or portion thereof, is intended to refer to two or more 
bound conformations that can be overlaid upon each other in 
3 dimensional space such that all corresponding atoms 
between the two conformations are overlapped. Accordingly, 
"substantially different" bound conformations cannot be 
overlaid upon each other in 3 -dimensional space such that 
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all corresponding atoms between the two bound conformations 
are overlapped. 



As used herein, the term "polypeptide" is intended 
to refer to a peptide polymer of two or more amino acids. 
5 The term is similarly intended to include polymers 
containing amino acid sterioisomers , analogues and 
functional mimetics thereof. For example, derivatives can 
include chemical modifications of amino acids such as 
aikylation, acylation, carbamylation, iodination, or any 
TO modi fic ation which derivatives the polypeptide. Analogues 
can include modified amino acids, for example, 
hydroxyproline or carboxyglutamate , and can include amino 
acids, or analogs thereof, that are not linked by peptide 
bonds. Mimetics encompass chemicals containing chemical 
15 moieties that mimic the function of the polypeptide 

regardless of the predicted three-dimensional structure of 
the compound. For example, if a polypeptide contains two 
charged chemical moieties in a functional domain, a mimetic 

places two charged chemical moieties in a spatial 

I u 

H 2 0 orientation and constrained structure so that the 

«:? 

u corresponding charge is maintained in three-dimensional 

space. Thus, all of these modifications are included within 
the term "polypeptide" so long as the polypeptide retains 
its binding function. 

25 As used herein, the term "root mean square 

deviation," or RMSD, refers to a standard deviation which 
quantifies the structural variability in a population of 
bound conformations of a ligand. The term is intended to be 
consistent with its meaning as understood in the art as 

3 0 described for example in Doucet and Weber, Computer- Aided 
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Molecular Design: Theory and Applications , Academic Press, 
San Diego CA (1996) . 



far 



As used herein, the term "family," when used in 
reference to characterizing polypeptides having ligand 
5 binding activity, is intended to refer to polypeptides that 
can bind to the same ligand, or portion thereof. A 
polypeptide family can contain polypeptides having binding 
activity for a common ligand with sufficient affinity, 
avidity or specificity to allow measurement of the binding 
10 event. As defined herein a "member" of a polypeptide family 
refers to an individual polypeptide that can be classified 
in a polypeptide family because the polypeptide binds a 
ligand, or portion thereof, that binds another polypeptide 
in a polypeptide family. The bound conformations of a 
15 ligand bound by individual* members of a family can be 
l« substantially the same or different from each other. 



As used herein, the term "pharmacof amily , " when 
used in reference to polypeptides, is intended to refer to 
p polypeptides that can be classified together in a population 

^ 20 because they individually bind a ligand such that the ligand 

is bound in substantially the same conformation. As defined 
herein a "member" of a polypeptide pharmacof amily refers to 
an individual polypeptide that is classified in a 
polypeptide pharmacof amily because the polypeptide binds a 
25 conformation of a ligand that is substantially the same as a 
conformation of the ligand bound to another polypeptide in 
the pharmacof amily . 
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As used herein, the term "grouping" refers to 
assigning related polypeptides into a family or 
pharmacof amily such that the polypeptide members of a family 
bind the same ligand and the polypeptide members of a 
5 pharmacof amily bind substantially the same bound 
conformation of a ligand. 



As used herein, the term "fold," when used in 
reference to a polypeptide, refers to a specific geometric 
arrangement and connectivity of a combination of secondary 

10 structure elements in a polypeptide structure. Secondary 
structure elements of a polypeptide that can be arranged 
into a fold including, for example, alpha helices, beta 
sheets, turns and loops are well known in the art. Folds of 
a polypeptide can be recognized by one skilled in the art 

15 and are described in, for example, Branden and Tooze, 

Introduction to protein structure . Garland Publishing, New 
York (1991) and Richardson, Adv. Prot . Chem. 34:167-339 
(1981) . 



As used herein, "modeling the three dimensional 
20 structure" when used in reference to a polypeptide refers to 
determining a conformation for a polypeptide. A 
conformation of a polypeptide can be determined, for 
example, from empirical data specifying structure or from a 
compared conformation used as a template. A conformation 
25 can be determined at any desired level of resolution 

sufficient to identify, for example, overall shape of a 
polypeptide, tertiary structure elements, secondary 
structure elements, polypeptide backbone structure, amino 
acid residue identity or location of individual atoms. 
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As used herein, the term "structural model," when 
used in reference to a polypeptide, refers to a 
representation of a 3 dimensional structure of a 
polypeptide. A structural model can be determined from 
5 empirical data derived from, for example, X-ray 

crystallography or nuclear magnetic resonance spectroscopy. 
A structural model can also be derived from a theoretical 
calculation including, for example, comparison to a known 
structure or ab initio molecular modeling. A representation 
10 of a structural model can include, for example, an electron 
density map, atomic coordinates, x-ray structure model, ball 
and stick model, density map, space filling model, surface 

: "5 

has 

yp map, Connolly surface, Van der Waals surface or CPK model. 

N 

'%.} As used herein, the term "conformer model" refers 

f"! 15. to a representation of points in a defined coordinate system 

4* wherein a point corresponds to a position of an atom in a 

j\ bound conformation of a ligand. The coordinate system is 

fy preferably in 3 dimensions, however, manipulation or 

si ; 

; j I t 

|Jg computation of a model can be performed in 2 dimensions or 

!Ij 20 even 4 or more dimensions in cases where such methods are 

** preferred. A point in the representation of points can, for 

example, correlate with the center of an atom. 
Additionally, a point in the representation of points can be 
incorporated into a line, plane or sphere to include a shape 

2 5 of one or more atom or volume occupied by one or more atom. 

A conformer model can be derived from 2 or more bound 
conformations of a ligand. For example a conformer model 
can be generated from 3 or more, 4 or more, 5 or more, 6 or 
more, 7 or more, 8 or more, 10 or more, 15 or more, 20 or 

3 0 more or 25 or more bound conformations of a ligand. 
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As used herein, the term "average structure," when 
used in reference to bound conformations of a ligand in a 
pharmacocluster , refers to conformer model, derived by 
superimposing the bound conformations of a ligand in a 
pharmacocluster, and determining an average location in 
space for corresponding atoms . 

As used herein, the term "pharmacophore model" 
refers to a representation of points in a defined coordinate 
system wherein a point corresponds to a position or other 
characteristic of an atom or chemical moiety in a bound = 
conformation of a ligand and/or an interacting polypeptide 
or ordered water. An ordered water is an observable water 
in a model derived from structural determination of a 
polypeptide. A pharmacophore model can include, for 
example, atoms of a bound conformation of a ligand, or 
portion thereof. A pharmacophore model can include both the 
bound conformations of a ligand, or. portion thereof, and one 
or more atoms that both interact with the ligand and are 
from a bound polypeptide. Thus, in addition to geometric 
characteristics of a bound conformation of a ligand, a 
pharmacophore model can indicate other characteristics 
including, for example, charge or hydrophobicity of an atom 
or chemical moiety. A pharmacaphore model can incorporate 
internal interactions within the bound conformation of a 
ligand or interactions between a bound conformation of a 
ligand and a polypeptide or other receptor including, for 
example, van der Waals interactions, hydrogen bonds, ionic 
bonds, and hydrophobic interactions. A pharmacophore model 
can be derived from 2 or more bound conformations of a 
ligand. For example a conformer model can be generated from 
3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or 
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more, 10 or more, 15 or more, 2 0 or more or 2 5 or more bound 
conformations of a ligand. 



A point in a pharmacophore model can, for example, 
correlate with the center of an atom or moiety. 
5 Additionally, a point in the representation of points can be 
incorporated into a line, plane or sphere to indicate a 
characteristic other than a center of an atom or moiety 
including, for example, shape of an atom or moiety or volume 
occupied by an atom or moiety. The coordinate system of a 

10 pharmacophore model is preferably in 3 dimensions, however, 
manipulation or computation of a model can be performed in 2 
dimensions or even 4 or more dimensions in cases where such 
methods are preferred. Multidimensional coordinate systems 
in which a pharmacophore model can be represented include, 

15 for example, cartesian coordinate systems, fractional 
coordinate systems, or reciprocal space. The term 
pharmacophore model is intended to encompass a conformer 
model . 

As used herein, the term "moiety" refers to a 
2 0 group of atoms that form a part or portion of a larger 

molecule. A moiety can consist of any number of atoms in a 
portion of a ligand and can correlate with a physical or 
chemical property conferred upon the ligand by the combined 
atoms. Exemplary moieties of a nicotinamide adenine 
25 dinucleotide ligand include a phosphate, nicotinamide ring, 
amino group, amide group or ribose ring. In addition, a 
nicotinamide adenine dinucleotide group can be a moiety. 
For example, a nicotinamide adenine dinucleotide can be a 
moiety of the 2'P phosphate in a nicotinamide adenine 
30 dinucleotide phosphate molecule (see Figure 2 for location 
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of the 2*P phosphate in nicotinamide adenine dinucleotide 
phosphate) . 

The invention provides a method for identifying a 
pharmacocluster . The method includes the steps of (a) 
determining bound conformations of a ligand bound to 
different polypeptides, and (b) clustering two or more bound 
conformations of the ligand having substantially the same 
bound conformation, thereby identifying a pharmacocluster. 
The invention also provides a method for identifying a 
member of a pharmacocluster. The method includes the steps 
of (a) determining a bound conformation of a ligand bound to 
a polypeptide; and (b) determining a pharmacocluster having 
substantially the same bound conformation as the bound 
conformation, thereby identifying the bound conformation of 
the ligand as a member of the pharmacocluster. 

A bound conformation of a ligand bound to a 
polypeptide can be determined from a previously observed 
molecular structure or from data specifying a molecular 
structure for a bound conformation of a ligand. Previously 
observed structures can be acquired for use in the invention 
by searching a database of existing structures. An example 
of a database that includes structures of bound 
conformations of ligands bound to polypeptides is the 
Protein Data Bank (PDB, operated by the Research 
Collaboratory for Structural Bioinf ormatics , see Berman et 
al . , Nucleic Acids Research , 28:235-242 (2000)). A database 
can be searched, for example, by querying based on chemical 
property information or on structural information. In the 
latter approach, an algorithm based on finding a match to a 
template can be used as described, for example, in Martin, 



"Database Searching in Drug Design," J. Med. Chem. 35:2145- 
2154 (1992) . 

A bound conformation of a ligand bound to a 
polypeptide can be determined from an empirical measurement, 
or from a database. Data specifying a structure can be 
acquired using any method available in the art for 
structural determination of a ligand bound to a polypeptide. 
For example, X-ray crystallography can be performed with a 
crystallized complex of a polypeptide and ligand to 
determine a bound conformation of the ligand bound to the 
polypeptide. Methods for obtaining such crystal complexes 
and determining structures from them are well known in the 
art as described for example in McRee et al . , Practical 
Protein Crystallography , Academic Press, San Diego 1993; 
Stout and Jensen, X-ray Structure Determination: A practical 
guide . 2 nd Ed. Wiley, New York (1989); and McPherson, The 
Preparation and Analysis of Protein Crystals , Wiley, New 
York (1982) . Another method useful for determining a bound 
conformation of a ligand bound to a polypeptide is Nuclear 
Magnetic Resonance (NMR) . NMR methods are well known in the 
art and include those described for example in Reid, Protein 
NMR Techniques , Humana Press, Totowa NJ (1997) ; and 
Cavanaugh et al . , Protein NMR Spectroscopy: Principles and 
Practice , ch. 7, Academic Press, San Diego CA (1996) . 

A bound conformation of a ligand can also be 
determined from a hypothetical model. For example, a 
hypothetical model of a bound conformation of a ligand can 
be produced using an algorithm which docks a ligand to a 
polypeptide of known structure and fits the ligand to the 
polypeptide binding site. Algorithms available in the art 
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for fitting a ligand structure to a polypeptide binding site 
include, for example, DOCK (Kuntz et al . , J. Mol . Biol. 
161:269-288 (1982)) and INSIGHT98 (Molecular Simulations 
Inc., San Diego, CA) . 



5 A molecular structure can be conveniently stored 

and manipulated using structural coordinates. Structural 
coordinates can occur in any format known in the art so long 
as the format can provide an accurate reproduction of the 
observed structure. For example, crystal coordinates can 
10 occur in a variety of file types including, for example, 

.fin, . df, .phs, or .pdb as described for example in McRee, 
supra. Although the examples above describe structural 
coordinates derived from X-ray crystallographic analysis or 
NMR spectroscopy, one skilled in the art will recognize that 
15 structural coordinates can be derived from any method known 
in the art to determine a bound conformation of a ligand 
5 bound to a polypeptide. 
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Structures at atomic level resolution can be 
useful in the methods of the invention. Resolution, when 

20 used to describe molecular structures, refers to the minimum 
distance that can be resolved in the observed structure. 
Thus, resolution where individual atoms can be resolved is 
referred to in the art as atomic resolution. Resolution is 
commonly reported as a numerical value in units of Angstroms 

25 (A, 10" 10 meter) correlated with the minimum distance which 
can be resolved such that smaller values indicate higher 
resolution. Bound conformations of a ligand useful in the 
methods of the invention can have a resolution better than 
about 10 A, 5 A, 3 A, 2 . 5 A, 2.0 A, 1.5 A, 1.0 A, 0.8 A, 0.6 

30 A, 0.4 A, or about 0 . 2 A or better. Resolution can also be 
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reported as an all atom RMSD as used, for example, in 
reporting NMR data. Bound conformations of a ligand useful 
in the methods of the invention can have an all atom RMSD 
better than about 10 A, 5 A, 3 A, 2.5 A, 2.0 A, 1.5 A, 1.0 
5 A, 0.8 A, 0.6 A, 0.4 A, or about- 0 . 2 A or better. 

An advantage of the methods of the invention is 
that a structure of a polypeptide bound to a bound 
conformation of a ligand need not be determined to identify 
a pharmacociuster . Thus, methods that detect only the 

10 structure. of the ligand can be used in' the invention. In 

some cases determination or refinement of only the structure 
of the ligand in a polypeptide -ligand complex will be 
required. In addition, methods that detect a conformation- 
dependent property of the ligand can be used to identify a 

15 pharmacociuster. Methods that can be used to determine a 
conformation-dependent property of a ligand in a 
polypeptide -ligand complex without determining the structure 
of the polypeptide include, for example, Electron Nuclear 
Double Resonance spectroscopy (ENDOR, as described in Van 

20 Doorslaer and Schweiger, Naturwissenschaf ten 87:245- 

55(2000)), Electron Paramagnetic Resonance spectroscopy 
(EPR, described in Cantor and Schimmel Biophysical 
Chemistry, Part I: The conformation of biological 
macromolecules W. H. Freeman and Company (1980)), chemically 

25 induced dynamic nuclear polarization (CIDNP, described in 
Siebert et al., Glycoconi J. 14 : 945-9 (1997) and Consonni et 
al., FEBS Lett. 372:135-9 (1995)), solid state NMR 
(described in Mehring, M. High Resolution NMR spectroscopy 
in Solids , 2 nd ed. Springer-Verlag, Berlin (1983) and liquid 

30 phase NMR (described in Wiithrich, NMR of Proteins and 

Nucleic Acids John Wiley & Sons, Inc. (1986)). Thus, the 
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invention can be performed in a manner whereby the time and 
cost associated with a full determination of a polypeptide 
structure is avoided. 

5 Any representation that correlates with the 

structure of a bound conformation of a ligand can be used in 
the methods of the invention. For example, a convenient and 
commonly used representation is a displayed image of the 
structure. Displayed images that are particularly useful 

10 for determining the bound conformation of a ligand bound to 
polypeptides include, _for example, ball _and .stick, models, - 
density maps, space filling models, surface map, Connolly 
surfaces, Van der Waals surfaces or CPK model. Display of 
SI images as a computer output, for example, on a video screen 

15 can be advantageous as described below. 

j» Clustering can be performed with any ligand or any 

= number of bound conformations of a ligand. The methods of 

i : 

the invention can be performed by clustering 2 or more bound 
fli conformations of a ligand. For example, clustering can be 

ru 

p 2 0 performed with 3 or more, 4 or more, 5 or more, 6 or more, 7 

C3 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or 

more, 13 or more, 14 or more, 15 or more or 2 0 or more bound 
conformations of a ligand. The methods of the invention can 
be used with any number bound conformations of a ligand. 
25 Due to the large sizes of data sets required to represent 

bound conformations of a ligand, methods of clustering bound 
conformations are generally performed on a computer. The 
methods are compatible with any computer that can support 
molecular modeling software including for example a personal 
30 computer, silicon graphics workstation, or supercomputer. A 
variety of computer software programs are available for 



molecular modeling including, for example, GRASP (Nicholls, 
A., supra), ALADDIN (Van Drie et al . supra), INSIGHT98 

(Molecular Simulations Inc., San Diego CA) , RASMOL (Sayle et 
al., Trends Biochem Sci. 20:374-376 (1995)) and MOLMOL 

(Koradi et al . , J . Mol . Graphics 14:51-55 (1996 )). 

Once a bound conformation of a ligand bound to 
different polypeptides has been determined, two or more 
bound conformations of the ligand can be compared and those 
having substantially the same bound conformation can be 
clustered. Methods of comparison include, for example, a 
method that provides alignment of two or more bound 
conformations of a ligand and evaluation of the degree of 
overlap in the two structures. Methods of comparison can be 
performed in an iterative fashion until a best fit is 
identified. 

Methods of comparing bound conformations of bound 
ligands include, for example, cluster analysis, visual ■ 
inspection and pairwise structural comparisons. Cluster 
analysis is commonly performed by, but not limited to, 
partitioning methods or hierarchical methods as described, 
for example, in Kauffman and Rousseeuw, Finding Groups in 
Data: An Introduction to Cluster Analysis , John Wiley and 
Sons Inc., New York (1990). Partitioning methods that can 
be used include, for example, partitioning around mediods, 
clustering large applications, and fuzzy analysis, as 
described in Kauffman and Rousseeuw, supra. Hierarchical 
methods useful in the invention include, for example, 
agglomerative nesting, divisive analysis, and monothetic 
analysis, as described in Kauffman and Rousseeuw, supra. 
Algorithms for cluster analysis of molecular structures are 
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known in the art and include, for example, COMPARE (Chiron 
Corp, 1995; distributed by Quantum Chemistry program 
Exchange, Indianapolis IN) . COMPARE can be used to make all 
possible pairwise comparisons between a set of conformations 
5 of the same ligand(s) . COMPARE reads PDB files and uses a 
Ferro-Hermanns ORIENT algorithm for a least squares root 
mean square (RMS) fit. The structures can be clustered into 
groups using the Jarvis- Patrick nearest neighbors algorithm. 
Based on the RMS deviation between ligand conformers, a list 

10 of 'nearest neighbors' for each conformer are generated. 
Two conformers are then grouped together or clustered if: 
(1) the RMS deviation is sufficiently small and (2) if both 
conformers share a determined number of common 'neighbors' . 
Both criteria are adjusted by the program to generate 

15 clusters based on a user defined cutoff for distance between 
individual clusters. Follow up analysis was conducted using 
Insightll to verify clusters. A member conformation is 
identified as being closer to the averaged coordinates of 
conformations within its family than to the averaged 

2 0 coordinates of any other family. 

Using methods such as those described above, one 
skilled in the art will know how to identify conformations 
that are substantially the same. For example, similarity 
can be evaluated according to the goodness of fit between 

25 two or more bound conformations of a ligand. Goodness of 
fit can be represented by a variety of parameters known in 
the art including, for example, the root mean square 
deviation (RMSD) . A lower RMSD between structures 
correlates with a better fit compared to a higher RMSD 

30 between structures. Bound conformations of a ligand having 
substantially the same conformations can be identified by 



comparing mean RMSD values within and between 
pharmacoclusters . Accordingly, bound conformations of a 
ligand having substantially the same conformations can have 
a mean RMSD compared to an average structure for the 
pharmacocluster that is less than 1.1 A. Two or more bound 
conformations of a ligand can be clustered by assigning 
bound conformations of a ligand into a collection such that 
the conformations of a ligand residing in the collection are 
substantially the same. Members of a pharmacocluster can 
also be identified as having RMSD values compared to an 
average structure for the pharmacocluster that are less -than 
1.0 A, 0.9 A, 0.8 A, 0.7 A, 0.6 A, 0.5 A, 0.4 A, 0.3 A, 0.2 
A or 0.1 A. 

A bound conformation of a ligand that is a member 
of a pharmacocluster can also be identified by comparing the 
RMSD for the bound conformation to an average conformation 
of the members in multiple pharmacoclusters. Using this 
value for comparison, a member conformation is identified as 
having a smaller RMSD when compared to the averaged 
coordinates of conformations within its family than when 
compared to the averaged coordinates of any other family. 
In addition, a member of a pharmacocluster can be identified 
as having an RMSD compared to an average conformation of the 
members in a pharmacocluster that is smaller than the RMSD 
between each family's average coordinates. For example, as 
described in Example I, RMSD values for members of 
pharmacoclusters 1-8 as presented in Tables 3A, 4A, 5A, 6A, 
7A, 8A, 9A or 10A, respectively, can be compared to RMSD 
values between each pharmacocluster as presented in Table 2 . 
Comparisons similar to those described above can be made for 
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bound conformations of any ligand according to the methods 
described in the Examples. 

In addition, bound conformations of a ligand can 
be compared with respect to dihedral angles at particular 
bonds. Exemplary methods for comparing dihedral angles 
between pharmacoclusters is described in Example I and Table 
1. Comparison between dihedral angles can be used, for 
example, in combination with overall RMSD comparisons such 
as those described above. Therefore, bound conformations 
that are not easily distinguished by comparison of overall - 
RMSD alone, can be distinguished according to the combined 
comparison of RMSD and dihedral angle. Bound conformations 
of a ligand that are members of different pharmacoclusters 
can have dihedral angles that differ, for example, by at 
least about 10 degrees, 30 degrees, 45 degrees, 90 degrees 
or 180 degrees. 

The invention also provides a pharmacocluster 
selected from the cluster consisting of pharmacocluster 1, 
pharmacocluster 2, pharmacocluster 3, pharmacocluster 4, 
pharmacocluster 5, pharmacocluster 6, pharmacocluster 7, and 
pharmacocluster 8 correlated with the pharmacof amilies 
listed in Table 11. 

Pharmacoclusters 1 through 8 contain bound 
conformations of NAD (P) (H) determined from structures 
deposited in the PDB for NAD (P) (H) bound to oxidoreductase 
polypeptides. Pharmacoclusters are shown in Figure 1 and 
described in further detail in Example I. The 
pharmacoclusters of Figure 1 display substantial overlap 
between bound conformations of NAD (P) (H) within the cluster, 
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as can be identified by visual inspection of the structures. 
Quantitative comparison of the bound conformations in each 
pharmacocluster demonstrates that each pharmacocluster 
displays less than about 1.1 A difference in RMSD between 
5 each conformation of NAD (P) (H) and the average bound 

conformation for the respective pharmacocluster as described 
in Example 1 . 

Pharmacoclusters can be used to identify a ligand 
having specificity for one or more polypeptide 

10 pharmacof amilies _ (see Example V) . As described herein ,_^a 
pharmacophore model or conformer model can be derived from 
one or more cluster. These models can be used to identify a 
ligand having specificity for one or more pharmacof amilies 
of oxidoreductases , for example, by using the model to query 

15 a database of molecules for a potential ligand or by using 
the model to guide in the design of a synthetic ligand. An 
example of using a pharmacophore of the invention to 
identify a binding compound is provided in Example VI. 

Pharmacoclusters, including, for example, 
2 0 pharmacoclusters 1 through 8 can also be used to identify a 
new polypeptide member of a polypeptide pharmacof ami ly. 
Using the methods described herein, for example, a 
pharmacocluster can be used to produce a pharmacophore model 
or conformer model to which a bound conformation of a ligand 
25 can be compared. A polypeptide bound to a bound 

conformation of a ligand that is similar to the model can be 
classified into an appropriate polypeptide pharmacof amily 
based on this comparison. By a similar method, a bound 
conformation of a ligand can be directly compared to a 
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pharmacocluster to classify the polypeptide bound to the 
conformation of a ligand into an appropriate pharmacof amily . 

The methods of the invention can also be used with 
a portion of a bound conformation of a ligand to identify a 
pharmacocluster. The method consists of (a) determining a 
bound conformation of a ligand, or portion thereof, bound to 
two or more polypeptides, and (b) clustering two or more 
bound conformations of the ligand, or portion thereof having 
substantially the same bound conformation, thereby 
identifying a. pharmacocluster . 

A bound conformation of a portion of a ligand can 
include selected atoms and/or bonds of a ligand and can 
include, for example, a continuous sequence of atoms and/or 
bonds or a discontinuous sequence of selected atoms and/or 
bonds that, when described independent of the complete 
ligand structure, may not appear to be attached to each 
other. Such a portion can include 2 or more atoms of a 
bound conformation of a ligand or 3 or more, 4 or more, 5 or 
more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or 
more, 15 or more, 20 or more, 25 or more or 50 or more atoms 
of a bound conformation of a ligand. A bound conformation 
of a portion of a ligand bound to a polypeptide can be 
identified according to the same methods described above for 
identifying a bound conformation of a ligand bound to a 
polypeptide. Two or more bound conformations of a portion 
of a ligand can be clustered as described above so long as 
the bound conformations that are clustered correspond to 
bound portions of the ligand having the same structural 
formula. For example, in a case where determination of the 
complete structure of a ligand has not been achieved, a 
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complete structure of a ligand has not been achieved, a 
bound conformation of a portion of the ligand corresponding 
to the structurally determined portion can be used in the 
methods of the invention. 

5 A pharmacocluster can include portions of bound 

conformations derived from different ligands so long as the 
portions have a core bound conformation that is 
substantially the same. For example, portions having the 
same structural formula and bond configuration can share a 

10 coire_ bound conf ormation . The bond, conf iguration describes,- 
the relative position of atoms attached to a chiral atom of 
a ligand. Accordingly, R and S sterioisomers of a chiral 
atom have different bond configurations. Other terms used 
in the art to designate different bond configurations 

15 include, for example, cis and trans configurations of atoms 
attached to carbons that are double bonded, or Z and E 
configurations of atoms attached to carbons that are double 
bonded. An example of portions of ligands having the same 
structural formula and bond configuration that can share a 

20 core bound conformation are the nicotinamide adenine 

dinucleotide portions of nicotinamide adenine dinucleotide 
phosphate (NADP) and nicotinamide adenine dinucleotide 
(NAD) . Additionally, portions of ligands having different 
charge, atom substitution or bond hybridization can share a 

25 core bound conformation. An example of portions of ligands 
having different charge and bond hybridization that can 
share a core bound conformation are the nicotinamide adenine 
dinucleotide portions of oxidized nicotinamide adenine 
dinucleotide (NAD) 'and reduced nicotinamide adenine 

30 dinucleotide (NADH) . In cases where the core structures of 
two ligands bind with substantially the same conformation to 
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polypeptides, the core bound conformations can be clustered 
according to the methods of the invention (see Example I) . 



U 
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Substantially the same bound conformation of a 
portion of a bound conformation of a ligand, including non- 
5 continuous atoms, can be identified according to the root 
mean square deviation and compared directly. Conformations 
of portions having different numbers of atoms can also be 
compared via root mean square deviation per equivalent atom 
{RMSD/N, where N is the number of atoms compared) . A lower 
10 value of RMSD/N indicates increased similarity between .the 
two or more bound ligand conformations that are clustered. 
One skilled in the art will know that RMSD/N has a 
\j compensational origin and consideration of the effect of N 

J'j ' is required for comparison of RMSD/N between 

H 15 pharmacoclusters having different values of N. For example, 

? » the lower the value of RMSD/N the lower should be the value 

s of N to indicate substantial similarity. 

T IE? 

fy The invention can be used with any ligand for 

111 

~l which bound conformations of the ligand bound to different 

D 2 0 polypeptides can be determined including, for example, 

chemical or biological molecules such as simple or complex 
organic molecules, metal -containing compounds, 
carbohydrates , peptides , peptidomimetics , carbohydrates , 
lipids ,. nucleic acids, and the like. 



25 In one embodiment, the compositions and methods of 

the invention can be used with a ligand that is a nucleotide 
derivative including, for example, a nicotinamide adenine 
dinucleotide-related molecule. Nicotinamide adenine 
dinucleotide-related (NAD-related) molecules that can be 



used in the methods of the invention can be selected from 
the group consisting of oxidized nicotinamide adenine 
dinucleotide (NAD + ) , reduced nicotinamide adenine 
dinucleotide (NADH) , oxidized nicotinamide adenine 
dinucleotide phosphate (NADP + ) , and reduced nicotinamide 
adenine dinucleotide phosphate (NADPH) . An NAD-related 
molecule can also be a mimetic of the above- described 
molecules. Use of a NAD-related molecule to identify 
pharmacoclusters is- described in Example I. 

A mimetic is a molecule that has at least one 
function that is substantially the same as a function of a 
second molecule. A mimetic of a ligand can be identified 
according to its ability to bind to the same sites on a 
polypeptide as the ligand. For example, a mimetic can be 
identified by a binding competition assay using a ligand and 
a mimetic. The structure of a mimetic can be similar or 
different compared to the structure of the second molecule. 
The term can encompass molecules having portions similar to 
corresponding portions of the ligand in terms of structure 
or function. 

Examples of mimetics to the common ligand NADH, 
for example cibacron blue, are described in Dye -Ligand 
Chromatography , Amicon Corp., Lexington MA (1980).- Numerous 
other examples of NADH-mimics, including useful 
modifications to obtain such mimics, are described in Everse 
et al . (eds.), The Pyridine Nucleotide Coenzymes , Academic 
Press, New York NY (1982) . Particular analogs include 
nicotinamide 2-aminopurine dinucleotide, nicotinamide 8- 
azidoadenine dinucleotide, nicotinamide 1-deazapurine 
dinucleotide, 3-aminopyridine adenine dinucleotide, 3 -acetyl 



pyridine adenine dinucleotide, thiazole amide adenine 
dinucleotide, 3 -diazoacetylpyridine adenine dinucleotide and 
5-aminonicotinamide adenine dinucleotide. Particular 
mimetics can be identified and selected by ligand- 
displacement assays, for example using competitive binding 
assays with a known ligand as is well known in the art. 
Mimetic candidates can also be identified by searching 
databases of compounds for structural similarity with the 
common ligand or a mimetic. 

In another embodiment, the methods of the 
invention can be used with a ligand that is an adenosine 
phosphate-related molecule. Adenosine phosphate-related 
molecules can be selected from the group consisting of 
adenosine triphosphate (ATP) , adenosine diphosphate (ADP) , 
adenosine monophosphate (AMP) , and cyclic adenosine 
monophosphate (cAMP) . An adenosine phophate-related 
molecule can also be a mimetic of the above-described 
molecules. A mimetic of an adenosine phosphate-related 
molecule that can be used in the invention includes, for 
example, quercetin, adenylylimidodiphosphate (AMP-PNP) or 
olomoucine. 

A ligand useful in the methods of the invention 
can be a cofactor, coenzyme or vitamin including, for 
example, NAD, NADP, or ATP as described above. Other 
examples include thiamine (vitamin BJ , riboflavin (vitamin 
B 2 ) , pyridoximine (vitamin B 6 ) , cobalamin (vitamin B 12 ) , 
pyrophosphate, flavin adenine dinucleotide (FAD) , flavin 
mononucleotide (FMN) , pyridoxal phosphate, coenzyme A, 
ascorbate (vitamin C) , niacin, biotin, heme, porphyrin, 
folate, tetrahydrof olate, nucleotide such as guanosine 
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triphosphate, cytidine triphosphate, thymidine triphosphate, 
uridine triphosphate, retinol (vitamin A) , calciferol 

(vitamin D 2 ) , ubiquinone, ubiquitin, ot-tocopherol (vitamin 
E) , farnesyl, geranylgeranyl , pterin, pteridine or S- 
5 adenosyl methionine (SAM) . 

A polypeptide can be used as a ligand in the 
invention. For example, a ligand can be a naturally 
occurring polypeptide ligand such as a ubiquitin or 
polypeptide hormone including, for example, insulin, human 

10 growth hormone, thyrotropin releasing hormone, 

adrenocorticotropic hormone, parathyroid hormone, follicle 
stimulating hormone, thyroid stimulating hormone, 
luteinizing hormone, human chorionic gonadotropin, epidermal 
growth factor, nerve growth factor and the like. In 

15 addition a polypeptide ligand can be a non-naturally 
occurring polypeptide that has binding activity. Such 
polypeptide ligands can be identified, for example, by 
screening a synthetic polypeptide library such as a phage 
display library or combinatorial polypeptide library as 

2 0 described below. A polypeptide ligand can also contain 
amino acid analogs or derivatives such as those described 
below. Methods of isolation of a polypeptide ligand are 
well known in the art and are described, for example, in 
Scopes, Protein Purification: Principles and Practice , 3 rd 

25 Ed., Springer-Verlag, New York (1994); Duetscher, Methods in 
Enzymology , Vol 182, Academic Press, San Diego (1990); and 
Coligan et al . , Current protocols in Protein Science , John 
Wiley and Sons, Baltimore, MD (2000) . 



A nucleic acid can also be used as a ligand in the 
invention. Examples of nucleic acid ligands useful in the 
invention include DNA, such as genomic DNA or cDNA or RNA 
such as mRNA, ribosomal RNA or tRNA. A nucleic acid ligand 
can also be a synthetic oligonucleotide. Such ligands can 
be identified by screening a random oligonucleotide library 
for ligand binding activity, for example, as described 
below. Nucleic acid ligands can also be isolated from a 
natural source or produced in a recombinant system using 
well known methods in the art including, for example, those 
described in Sambrook et al . , Molecular Cloning - 
Laboratory Manual . 2nd ed. , Cold Spring Harbor Press, 
Plainview, New York (1989); Ausubel et al., Current 
Protocols in Molecular Biology (Supplement 47) , John Wiley & 
Sons, New York (1999) . 

A ligand used in the invention can be an amino 
acid, amino acid analog or derivatized amino acid. An amino 
acid ligand can be one of the 20 essential amino acids or 
any other amino acid isolated from a natural source. Amino 
acid analogs useful in the invention include, for example, 
neurotransmitters such as gamma amino butyric acid, 
serotonin, dopamine, .or norepenephrine or hormones such as 
thyroxine, epinephrine or melatonin. A synthetic amino 
acid, or analog thereof, can also be used in the invention. 
A synthetic amino acid can include chemical modifications of 
an amino acid such as alkylation, acylation, carbamylation, 
iodination, or any modification that derivatizes the amino 
acid. Such derivatized molecules include, for example, 
those molecules in which free amino groups have been 
derivatized to form amine hydrochlorides, p- toluene sulfonyl 
groups, carbobenzoxy groups, t-butyloxycarbonyl groups, 
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chloroacetyl groups or f ormyl groups . Free carboxyl groups 
can be derivatized to form salts, methyl and ethyl esters or 
other types of esters or hydrazides. Free hydroxyl groups 
can be derivatized to form O-acyl or O-alkyl derivatives. 
5 The imidazole nitrogen of histidine can be derivatized to 
form N-im-benzylhistidine . Naturally occurring amino acid 
derivatives of the twenty standard amino acids can also be 
included in a cluster of bound conformations including, for 
example , 4 - hydroxyproline , 5 -hydroxy lysine , 
10 3-methylhistidine, homoserine, ornithine or 
carboxyglutamate . ^ _ • 

A lipid ligand can also be used in the invention. 
Examples of lipid ligands include triglycerides, 
phospholipids, glycolipids or steroids. Steroids useful in 
15 the invention include, for example, glucocorticoids, 

mineralocorticoids, androgens, estrogens or progestins. 

Another type of ligand that can be used in the 
invention is a carbohydrate. A carbohydrate ligand can be a 
monosaccharide such as glucose, fructose, ribose, 
20 glyceraldehyde, or erythrose; a disaccharide such as 

lactose, sucrose, or maltose; oligosaccharide such as those 
recognized by lectins such as agglutinin, peanut lectin or 
phytohemagglutinin, or a polysaccharide such as cellulose, 
chitin, or glycogen. 

25 Methods for producing pluralities of compounds to 

use as ligands, including chemical or biological molecules 
such as simple or complex organic molecules, metal - 
containing compounds, carbohydrates, peptides, 
peptidomimetics , carbohydrates, lipids, nucleic acids, and 
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the like, are well known in the art (see, for example, in 
Huse, U.S. Patent No. 5,264,563; Francis et al . , Curr. Opin. 
Chem. Biol . 2:422-428 (1998); Tietze et al . , Curr. Biol . , 
2:363-371 (1998); Sofia, Mol . Divers . 3:75-94 (1998); 
5 Eichler et al . , Med. Res. Rev. 15:481-496 (1995); Gordon et 
al., J. Med. Chem. 37: 1233-1251 (1994); Gordon et al., 
Med. Chem. 37: 1385-1401 (1994); Gordon et al., Acc . Chem. 
Res . 29:144-154 (1996); Wilson and Czarnik, eds . , 
Combinatorial Chemistry: Synthesis and Application , John 

10 Wiley & Sons, New York (1997), Gold et al., U.S. Pat Nos . 
5,475,096 (1995) , 5, 789, 157 (1998) , and 5,270,163 (1.9.93.)-)- . 
The advantage of using such a combinatorial library is that 
molecules do not have to be individually generated to 
identify a ligand that binds a polypeptide. Also, no prior 

15 knowledge of the exact characteristics of a binding 

polypeptide is required when using a combinatorial library. 
Libraries containing large numbers of natural and synthetic 
compounds also can be individually synthesized or obtained 
from commercial sources. 



20 In addition, the invention provides a method for 

identifying a conformation-dependent property of a ligand. 
The method includes the steps of (a) determining bound 
conformations of a ligand bound to different polypeptides; 

(b) identifying two or more bound conformations of the 

25 ligand having substantially the same bound conformation, and 

(c) identifying a conformation-dependent property of the 
bound conformations of the ligand having substantially the 
same bound conformation, the conformation-dependent property 
being correlated with the bound conformation of the ligand. 
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A conformation-dependent property can be 
identified as any property that correlates with a bound 
conformation of a ligand such that a change in the bound 
conformation results in a change in the conf ormation- 
5 dependent property. Accordingly, a bound conformation of a 
ligand, or a portion thereof, can be a conformation- 
dependent property. A portion of a bound conformation of a 
ligand can be a contiguous fragment or a non- contiguous set 
of atoms or bonds. A bound conformation of a ligand, or 
10 portion thereof, can -be identified by any method for 

determining .the three dimensional -structure of a ligand 
including as disclosed herein. 



Other conf ormat ion -dependent properties include, 
for example, absorption and emission of heat, absorption and 
15 emission of electromagnetic radiation, rotation of polarized 
light, magnetic moment, spin state of electrons, or 
polarity, as disclosed herein, or other properties that can 
be identified as a spectroscopic signal. Methods known in 
the art for measuring changes in absorption and emission of 
20 heat that correlate with changes in bound conformation of a 
C3 ligand include, for example, calorimetry. Methods known in 

the art for measuring changes in absorption and emission of 
electromagnetic radiation as they correlate with changes in 
bound conformation of a ligand include, for example, UV/VIS 
25 spectroscopy, fluorimetry, luminometry, infrared 
spectroscopy, Raman spectroscopy, resonance Raman 
spectroscopy, X-ray absorption fine structure spectroscopy 
(XAFS) and the like. A change in a bound conformation of a 
ligand that is correlated with a change in rotation of 
30 polarized light can be measured with circular dichroism 

spectroscopy or optical rotation spectroscopy. A change in 
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magnetic moment or spin state of an electron that correlates 
with a change in a bound conformation can be measured, for 
example, with Electron paramagnetic resonance spectroscopy 
(EPR) or nuclear magnetic resonance spectroscopy (NMR) . 

5 When based on NMR data, a conformation-dependent 

property can be identified as an NMR signal including, for 
example, chemical shift, J coupling, dipolar coupling, 
cross-correlation, nuclear spin relaxation, transferred 
nuclear Overhauser effect, and any combination thereof. A 

10 conformation-dependent property can be identified by NMR 
methods in both fast and slow exchange regimes. For 
example, in many cases, the exchange rate of a complex 
between ligand and polypeptide is faster than the ligand 
spin relaxation rate (l/T 1H ) . * In this situation, referred 

15 to as the "fast exchange regime," transferred nuclear 
Overhauser effect (NOE) experiments can be performed to 
measure an intra-ligand proton-proton distance (Wuthrich, 
NMR of proteins and Nucleic Acids , Wiley, New York (1986) 
and Gronenborn, J. Maan. Res. 53:423-442 (1983).). Labeling 

20 of polypeptides is not required, and the ligand polypeptide 
concentration ratio can be adjusted to minimize line 
broadening of the ligand resonances while retaining strong 
NOE contribution from the bound form. 

In a fast exchange regime, cross-correlated 
25 relaxation measurements can also provide structural 

information on ligand torsion angles (Carlomagno et al., J. 
Am. Chem Soc . 121:1945-1948 (1999)). These measurements 
include the dipole-dipole cross-correlation but can be 

extended to other cross-correlated relaxation mechanisms 
3 0 involving also homo- and heteronuclear chemical shielding 
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anisotropy relaxation, as well as quadrupolar relaxation. 
For most of these heteronuclear experiments, the natural 
abundance of the isotope can be exploited. In cases where 
natural abundance of the isotope measured is not sufficient, 
5 isotope enriched ligands can be obtained from commercial 

sources such as Isotek (Miamisburg, OH) or Cambridge Isotope 
Laboratories (Andover, MA) or prepared by methods known in 
the art. Another method to determine a conformation- 
dependent property of a ligand in a fast exchange regime is 
10 use of residual homo- and heteronuclear dipolar couplings in 
partially aligned samples (Tolman et al . Proc. Natl. Acad. 
Sci. USA 92:9279-9283 (1995)). 

a 

In the slow exchange regime, the NMR signals 
arising from the bound conformation of the ligand are 
distinguished from those of the polypeptide to reduce 
resonance overlap. This can be achieved with different 
isotope labeling schemes of polypeptide, ligand or both. 
For large systems, perdeuteration of macromolecules and 
TROSY-type experiments (Pervushkin, Proc. Natl. Acad. Sci. 
USA 94:12366-12371 (1997)) can be used to minimize signal 
losses due to fast transverse relaxation of the resonances 
of the complex. With the appropriate sample requirements 
and isotope filtered experiments, cross-correlations, cross- 
relaxations and residual dipolar couplings can be measured 
and provide necessary structural information. 
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In addition, homo- and heteronuclear two and three 
bond J couplings can be obtained to provide information on 
torsion angles (Wuthrich, supra) . For example, as shown in 
Table 1 the bound conformations of NADP in pharmacocluster 4 
30 and pharmacocluster 5 differ by a torsion angle defined by 



the atoms PN-05 1 N-C5 * N-C4 ■ N (See Figure 2 for atom labeling 
and bond location) . Specifically, pharmacocluster 4 has a 
PN-05 1 N-C5 1 N-C4 ' N torsion angle of 145 degrees and 
pharmacocluster 5 has a PN-05 ' N-C5 ' N-C4 1 N angle of -112 
degrees. These torsion angles can be measured and 
distinguished by measuring the three bond 31 P- 13 C4 ■ J 
coupling constants that correspond to this torsion angle 
(Marino, Acc . Chem. Res. 32:614-623 (1999)). Basically, two 
1 H- 13 C correlation experiments can be performed with and 
without 31 P decoupling during 13 C evolution. The intensity 
ratio of the X H 4 1 / 13 C4 ' cross peak f rom each experiment is 
proportional to the 31 P- 13 C4 ! J coupling constant. 

Correlation of a conformation-dependent property 
with a bound conformation of a ligand can be achieved by any 
method that has sufficient sensitivity to detect changes 
that correlate with changes in bound conformation of a 
ligand. Such a correlation can be determined by measuring a 
conformation-dependent property for various conformations of 
a ligand and determining the extent of change in the signal 
with change in the conformation. Signal changes that 
correlate with changes in conformation and that are 
detectable with a signal to noise ratio accepted in the art - 
as significant can be used in the invention. 

Correlation between a conformation-dependent 
property and a conformation can be determined for a ligand 
bound to any partner so long as binding is specific and 
stable. For example, for purposes of establishing a 
correlation, changes in a conformation dependent property 
that correlate with changes in bound conformation of a 
ligand can be determined for a ligand bound to polypeptides 



from different polypeptide pharmacof amilies . A bound 
conformation of the ligand in each complex can be determined 
and a conformation-dependent property can be measured for 
each complex. Comparison of bound conformations of the 
ligand in each complex with a measured conformation- 
dependent property can be used -to establish a correlation. 
Demonstration of a method for establishing a correlation 
between an NMR signal and bound conformations of a ligand is 
described herein (see Example IV) . Other methods for 
correlating spectroscopic signals with bound conformations 
of a ligand are known in the art including, _ for ejcample, 
correlation of transferred NOE signals with anti and syn 
conformations of the nicotinamide ring in NADPH as described 
in Sem and Kasper Biochemistry 31:3391-3398 (1992) . 
Correlation of transferred NOE signals with conformation is 
also described in Clore and Gronenborn, J. -Magn. Reson. 
48 :402-417 (1982) . 

A correlation between a bound conformation and a 
conformation-dependent property can also be established for 
a ligand bound to a non-polypeptide binding partner because 
a conformation-dependent property of a ligand can be 
independent of interactions that differ between binding 
partners so long as the ligand is in the same bound 
conformation when bound to the binding partners. Other 
binding partners include, for example, nucleic acids, 
carbohydrates, and synthetic organometallic complexes. 

A method of the invention for identifying a 
conformation-dependent property of a ligand can also include 
the steps of (a) determining a bound conformation of a 
ligand, or portion thereof, bound to two or more 
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polypeptides; (b) identifying two or more bound 
conformations of the ligand, or portion thereof, having 
substantially the same bound conformation, and (c) 
identifying a conformation-dependent property of the bound 
5 conformations of the ligand, or portion thereof, having 

substantially the same bound conformation, the conformation- 
dependent property being correlated with the bound 
conformation of the ligand, or portion thereof. A 
conformation- dependent property of a portion of a ligand can 
10 be identified, for example, by using the methods described 
above for identifying a conformation-dependent property of a 
ligand. 



The invention also provides a method for 
identifying a polypeptide pharmacof amily . The method 
15 includes the steps of (a) determining bound conformations of 
a ligand bound to different polypeptides of a polypeptide 
family, and (b) identifying two or more bound conformations 
j=n of the ligand having substantially different bound 

* is? 

FU conformations, thereby identifying at least two polypeptide 

m 

~ 20 pharmacof amilies exhibiting binding specificity for the two 

D or more substantially different bound conformations of the 

ligand . 



A method for identifying a polypeptide 
pharmacof amily can include the steps of (a) determining 

25 bound conformations of a ligand bound to different 

polypeptides of a polypeptide family; (b) clustering bound 
conformations of a ligand having substantially the same 
conformations into pharmacoclusters ; and (c) identifying a 
first polypeptide that binds a bound conformation of a 

30 ligand in one pharmacocluster and a second polypeptide that 



binds a bound conformation of a ligand in a second 
pharmacocluster as belonging to separate polypeptide 
pharmacof amilies . 

Polypeptides of a polypeptide family can be 
identified by their ability to specifically bind to the same 
ligand, or portion thereof. Specific binding between a 
polypeptide and a ligand can be identified by methods known 
in the art. Methods of determining specific binding 
include, for example, equilibrium binding analysis, 
competition assays, and kinetic assays as described in 
Segel, Enzyme Kinetics John Wiley and Sons, New York (1975), 
and Kyte, Mechanism in Protein Chemistry Garland Pub. 
(1995) . Thermodynamic and kinetic constants can be used to 
identify and compare polypeptides and ligands that 
specifically bind each other and include, for example, 
dissociation constant (K d ) , association constant (K a ) , 
Michaelis constant (Kj , inhibitor dissociation constant 
(K is ) association rate constant (k on ) or dissociation rate 
constant (k off ) . For example, a family can be identified as 
having members that can specifically bind a ligand with a K d 
of* at most 1CT 3 M, 10" 4 M, 10 -5 M, 10' 6 M, 10" 7 M, 10" 8 M, 10" 9 
M, 10" 10 M, 10" 11 M, or 10" 12 M or lower. 

A family of polypeptides that bind a ligand can 
contain a pharmacof amily that binds substantially the same 
conformation of the ligand, or portion thereof. The methods 
can be used to identify any number of pharmacof amilies in a 
family according to the number of different bound 
conformations of a ligand identified. In cases where two or 
more polypeptide pharmacof amilies reside in a polypeptide 
family, the pharmacof amilies can be distinguished according 
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to differences in bound conformations of a ligand bound to 
the polypeptides. In this case, a bound conformation of a 
ligand can be determined and compared according to the 
methods described herein. Polypeptides bound to different 
5 bound conformations of a ligand can be identified as those 
that do not show substantial overlap of all corresponding 
atoms when bound conformations are overlaid. Thus, 
polypeptides that bind different bound conformations of a 
ligand can be separated into different pharmacof amilies . 
10 Pharmacof amilies in turn can be identified as containing 
polypeptides that bind substantially the same bound ._. _ 
conformation of a ligand (see Examples II and III) . 

A pharmacof ami ly of polypeptides identified by the 
methods of the invention can have additional similarities 

15 that correlate with similarities in bound conformation of a 
ligand. For example, a polypeptide pharmacof amily 
identified by the methods of the invention can consist of 
polypeptide members that share characteristics that are 
unique to the pharmacof amily when compared to one or more 

20 other polypeptides in a different pharmacof amily of the same 
family. Such characteristics can include, for example, 
protein fold, evolutionary relatedness, enzymatic activity, 
domain structure, subcellular localization, interaction 
partners, or participation in a similar metabolic or signal 

25 transduction pathway. A demonstration of a correlation 

between ligand bound conformation and another characteristic 
of polypeptides in a pharmacof amily is provided in Example 
II, which describes correlation of bound conformation of a 
ligand with polypeptide structure. 
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An example of a polypeptide family having multiple 
pharmacof amilies that can be identified by the methods of 
the invention includes NAD (P) (H) binding polypeptides. 
Polypeptide pharmacof amilies identified according to 
5 differences in bound conformations of NAD(P) (H) are 

described in Example II and Table 11. Thus, the methods can 
be used to identify a polypeptide pharmacof ami ly selected 
from the group consisting of pharmacof ami ly 1, 
pharmacof amily 2, pharmacof amily 3, pharmacof amily 4, 
10 pharmacof amily 5, pharmacof amily 6, pharmacof amily 7, and 
pharmacof amily 8 ... _ _______ _ _ . . . 

The invention provides a polypeptide 
pharmacof amily, comprising polypeptides that bind to 
substantially the same bound conformation of a nicotinamide 
15 adenine dinucleotide-related molecule selected from 
pharmacof amily 1, pharmacof amily 2, pharmacof amily 3, 
pharmacof amily 4, pharmacof amily 5, pharmacof amily 6, 
pharmacof amily 7, and pharmacof amily 8 as listed in Table 
11. 

20 Pharmacof amilies 1 through 8 consist of the 

polypeptide members provided in Table 11 (see Example II) . 
The polypeptides in pharmacof amily 1 have the NAD (P) (H) 
binding Rossman fold in common, are all in the NAD (P) (H) 
binding Rossman SCOP Superfamily, and fall into the SCOP 

25 families of the amino-terminal do main of glyceraldeIiVd e-3 - 
phosphate dehydrogenase, the carboxy- terminal domain of 
alcohol/glucose dehydrogenase, the NAD binding domain of 
f ormate/glycerate dehydrogenase, the carboxy- terminal domain 
of amino acid dehydrogenase, or the amino- terminal domain of 

30 lactate & malate dehydrogenase. 
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The polypeptides in pharmacof amily 2 have the 
NAD (P) (H) binding Rossman fold in common, are all in the 
NAD (P) (H) binding Rossman SCOP Superf amily, and fall into 
the SCOP families of the carboxy-terminal domain of amino 
5 acid dehydrogenase, glyceraldehyde- 3 -phosphate 

dehydrogenase, and 6-phosphogluconate dehydrogenase. 

The polypeptides in pharmacof amily 3 have the 
NAD (P) (H) binding Rossman fold in common, are all in the 
NAD (P) (H) binding Rossman SCOP Superf amily, and fall into 
10 the tyrosine -dependent oxidoreductase SCOP family. 



The polypeptides in pharmacof amily 4 have the 
heme -linked catalase fold and are in the heme -linked 
catalase SCOP superfamily and heme-linked catalase SCOP 
family. 

The polypeptides in pharmacof amily 5 have the p-a 
TIM barrel fold in common, are all in the NAD (P) (H) linked 
oxidoreductase SCOP Superfamily, and fall into the aldo-keto 
reductase SCOP family. 

The polypeptides in pharmacof amily 6 are 
20 dihydrof olate reductases that all show the dihydrof olate 
reductase fold and fall into the dihydrof olate reductase 
SCOP superfamily and family. 

The polypeptides in pharmacof amily 7 have the 
FAD/NAD (P) (H) binding domain fold in common, are all in the 
25 FAD/NAD (P) (H) binding domain SCOP Superfamily, and fall into 
the the amino-terminal and central domains of FAD/NAD linked 
reductase SCOP family. 
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The polypeptides in pharmacof amily 8 have the 
ferrodoxin like fold in common, are all in the ferrodoxin 
like SCOP Superfamily, and fall into the NADPH-cytochrome 
P450 reductase or reductase SCOP families. 
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5 Polypeptide pharmacof amilies 1 through 8 were 

identified according to binding interactions with bound 
conformations of NAD (P) (H) in pharmacoclusters 1 through 8, 
as described in Example II. Accordingly, the invention 
provides a polypeptide pharmacof ami iy, comprising 
10 polypeptides that bind to a nicotinamide adenine 

dinucleotide-related molecule having a bound conformation 
selected from pharmacocluster 1, pharmacocluster 2, 
pharmacocluster 3, pharmacocluster 4, pharmacocluster 5, 
pharmacocluster 6, pharmacocluster 7, and pharmacocluster 8. 

15 

The invention additionally provides a method for 
identifying a member of a polypeptide pharmacof amily . The 
method consists of (a) determining a conformation-dependent 
property of a ligand bound to a polypeptide, and (b) 

2 0 determining a pharmacocluster having substantially the same 
conformation -dependent property as the conformation- 
dependent property determined for the bound ligand, wherein 
a polypeptide pharmacof amily binds the ligand in a 
conformation of the pharmacocluster, thereby identifying the 

25 polypeptide as a member of the polypeptide pharmacof amily . 
For example, the method can be used with a ligand such as a 
nicotinamide adenine dinucleotide-related molecule or 
adenosine phosphate-related molecule (see Examples II and 
III) . 
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The methods of the invention allow a new member of 
a polypeptide pharmacof amily to be identified based on 
correlation of a conformation-dependent property of a bound 
conformation of a ligand bound to a polypeptide with a 
conformation-dependent property established for a bound 
conformation of the ligand bound to another polypeptide in 
the same pharmacof amily . Thus, a classification can be made 
based on ligand structure without requiring determination of 
the bound conformation of the ligand. In one embodiment, 
the conformation-dependent property can be a model of a 
bound conformation. A bound conformation of a ligand bound 
to a test polypeptide can be determined, and the bound 
conformation can be compared to a pharmacocluster according 
to the methods described herein. Substantial overlap 
between the bound conformation of the ligand bound to the 
test polypeptide and another bound conformation of the 
ligand bound to a polypeptide in a pharmacof amily can be 
used to identify the test polypeptide as a member of that 
polypeptide pharmacof amily . 

In another embodiment, the conformation-dependent 
property can be a spectroscopic signal that is correlated 
with the conformation of a ligand. A spectroscopic signal 
can be measured for the ligand bound to a test polypeptide. 
The signal can be compared to a signal correlated with a 
bound conformation of a ligand bound to a polypeptide in a 
polypeptide pharmacof amily . Substantial similarity between 
the two signals indicates that the bound conformation of the 
ligand bound to the test polypeptide is substantially 
similar to the bound conformation of the ligand bound to the 
polypeptides of the pharmacof amily . Thus, the test 



polypeptide can be identified as a member of the polypeptide 
pharmacof ami ly . 

The invention provides rapid and efficient methods 
that can be used in a high- throughput screening format. 
High- throughput methods can be useful for identifying a 
member of a polypeptide pharmacof amily. In a case where a 
conformation-dependent property can be rapidly detected and 
processed, automated methods can be created for measuring 
samples in rapid succession or measuring multiple samples in 
parallel. Automated methods can be used for rapidly 
handling samples including, for example, robotic 
instruments. A combination of automated sample handling 
methods with detection of a conformation-dependent property 
can, therefore, be useful in a high- throughput screening 
method . 

According to the methods of the invention a 
compound can be identified that has greater specificity for 
the polypeptides of one pharmacof amily than for other 
polypeptides in the same family. Such a compound can be 
used to identify new members of a pharmacophore family using 
a binding assay. For example, a mimetic or analog of a 
ligand can be identified that preferentially adopts a 
conformation more similar to conformations in a particular 
pharmacocluster than those in other pharmacoclusters . Such 
a mimetic or analog can be used in a any binding assay 
capable of detecting interactions with a polypeptide, 
including, for example, high- throughput methods. 



A member of a polypeptide pharmacof amily can also 
be identified by searching a database of bound conformations 
of a ligand. For example, a bound conformation of a ligand 
that binds to a polypeptide of an identified pharmacof amily 
can be used as a query in a 3 dimensional search of a 
database containing bound conformations of a ligand. 
Overlap between the query conformation and a retrieved bound 
conformation of the ligand can be used to identify a 
polypeptide bound to the retrieved bound conformation of the 
ligand as a member of the same polypeptide pharmacof amily as 
a polypeptide that binds the query bound conformation .(.see 
Example I) . 

The invention also provides a method of modeling 
the three dimensional structure of a polypeptide. The 
method consists of (a) determining a conformation-dependent 
property of a ligand bound to a polypeptide; (b) determining 
a pharmacocluster having substantially the same 
conformation-dependent property as the conformation- 
dependent property determined for the bound ligand, wherein 
a polypeptide pharmacof amily binds the ligand in a 
conformation of the pharmacocluster, thereby identifying the 
polypeptide as a member of the polypeptide pharmacof amily , 
and (c) modeling the three dimensional structure of the 
polypeptide according to a structural model of the second 
member of the polypeptide pharmacof amily . 

As disclosed herein, polypeptides in a 
pharmacof amily can have similar characteristics including, 
for example, similar 3 dimensional structure. Therefore, 
the 3 dimensional structure of a polypeptide identified by 
the invention as a member of a pharmacof amily can be modeled 
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using a polypeptide that is in the same pharmacof amily and 
for which the structure is known. A variety of methods are 
known in the art for modeling the three dimensional 
structure of a polypeptide according to the amino acid 
5 sequence of the polypeptide and a structure of a second 
polypeptide used as a template. Available algorithms 
include, for example, GRASP (Nicholls, A., supra), ALADDIN 
(Van Drie et al . supra), INSIGHT98 (Molecular Simulations 
Inc., San Diego CA) , RASMOL (Sayle et al . , Trends Biochem 
10 Sci . 20:374-376 (1995)) and MOLMOL (Koradi et al . , J. Mol . 
Graphics 14:51-55 (1996 )). 

ifl A model of a polypeptide determined by the methods 

11 of the invention can be useful for identifying a function of 

S.j the polypeptide. For example, residues of a polypeptide 

[*. 15 that are involved in binding can be identified using a model 

4Z of the invention. Residues identified as participating in 

f. binding can be modified, for example, to engineer new 

Hj functions into a polypeptide, to reduce an intrinsic 
.activity of a polypeptide, or to enhance an intrinsic 

□ 2 0 activity of a polypeptide. In another example, a model of a 

pi 

*** polypeptide can be compared to other polypeptide structures 

to identify similar functions. Exemplary functions that can 
be identified from a polypeptide structure include binding 
interactions with other polypeptides and catalytic 
25 activities. 



The invention also provides a method for 
constructing a ligand conformer model by determining an 
average structure of the bound conformations of a ligand in 
a pharmacocluster . A method for constructing a ligand 
30 conformer model can include the steps of (a) determining 
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bound conformations of a ligand bound to different 
polypeptides; (b) clustering two or more bound conformations 
of the ligand having substantially the same bound 
conformation, thereby identifying a pharmacocluster, and (c) 
determining an average structure of the bound conformations 
of the ligand in the pharmacocluster. Additionally, a 
method for constructing a ligand conformer model can include 
the steps of (a) determining a bound conformation of a 
ligand bound to a polypeptide; (b) determining a 
pharmacocluster having substantially the same bound 
conformation as the bound conformation, thereby- identifying 
the bound conformation of the ligand as a member of the 
pharmacocluster, and (c) determining an average structure of 
the bound conformations of the ligand in the 
pharmacocluster . 

An average structure of the bound conformations of 
a ligand in a pharmacocluster can be determined by a variety 
of methods known in the art. For example, an average 
structure can be determined by overlaying bound 
conformations, or portions thereof, and identifying an 
average location for each atom. Bound conformations in a 
group to be averaged can be overlayed relative to a single 
member or relative to a centroid position for each atom. 
Algorithms for determining an average structure are known in 
the art and include for example the OVERLAY routine in 
INSIGHT98 (Molecular Simulations Inc., San Diego CA) . 

The format of a ligand conformer model can be 
chosen based on the method used to generate the model and 
the desired use of the model. In this regard, a conformer 
model can be represented as a single structure. The 
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resulting structure can be a unique structure compared to 
the conformations in the pharmacocluster from which it was 
derived. Thus, the conformer model can be a new structure 
never before observed in nature. A model represented by a 
5 single structure can be useful for making visual comparisons 
by overlaying other structures with the model . A conformer 
model can also be represented as a plurality of structures 
incorporating all or a subset of the bound conformations in 
the pharmacocluster. A model represented by multiple 
10 structures can be useful for identifying a range of minor 
deviations in the model.' - 



In yet another representation, the conformer model 
can be a volume surrounding all or a subset of the bound 
conformations in the pharmacocluster. A model showing 

15 volume can be useful for comparing other structures in a 
fitting format such that a structure which fits within the 
volume of the model can be identified as substantially 
similar to the model. One approach that can be used to fit 
a structure to a volume is comparison of equivalent surface 

20 patches using gnomonic projection as described for example 
in Chau and Dean, J. Mol. Graphics 7:130 (1989) . Use of a 
gnomonic projection to compare structures is also described 
in Doucet and Weber, Computer-Aided Molecular Design: Theory 
and Applications , Academic Press, San Diego CA (1996) . 

25 Algorithms which can be used to fit a structure to a volume 
are known in the art and include, for example, CATALYST 
(Molecular Simulations Inc., San Diego, CA) and THREEDOM 
which is a part of the INTERCHEM package which makes use of 
an Icosahedral Matching Algorithm (Bladon, J. Mol. Graphics 

30 7:130 (19S9) for the comparison and alignment of structures. 
An exemplary method of identifying a binding compound by 
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searching a database of structures using a gnomonic 
projection is provided in Example V. 

A conformer model can be useful in querying a 
database of polypeptide structures to find other members of 
a polypeptide pharmacof amily . For example, a member of a 
polypeptide pharmacof amily can be identified by querying a 
database of bound conformations of* a ligand to identify a 
retrieved bound conformation of a ligand that is 
substantially similar to the query structure, thereby 
identifying- a polypeptide bound to the retrieved- bound - 
conformation as a member of the same pharmacof amily as a 
polypeptide bound to the query bound conformation. A 
conformer model can also be used to identify a new member of 
a polypeptide pharmacof amily by querying a database of one 
or more polypeptide structures using an algorithm that docks 
the conformer model, wherein a favorable docking result with 
a retrieved polypeptide indicates that the retrieved 
polypeptide is a member of the same polypeptide 
pharmacof amily as a polypeptide bound to the bound 
conformation used as a query. In the latter mode, a 
potential new member of a pharmacof amily from which the 
conformer model was derived can be identified. The database 
queries described above can be performed with algorithms 
available in the art including, for example, THREEDOM and 
CATALYST . 

An advantage of the invention is that a conformer 
model can be used to identify a binding compound that is 
specific for polypeptides of a pharmacof amily . For example, 
the conformer model can be compared to a structure of a 
compound or to a bound conformation of a ligand to identify 



those having similar conformation. A conformer model can be 
further used to query a database of compounds to identify 
individual compounds having similar conformations. 

A conformer model of the invention can also be 
used to design a binding compound that is specific for 
polypeptides of one or more pharmacof amilies . The methods 
of the invention provide a conformer model that can be 
produced according to a cluster of bound conformations of a 
ligand that are specific for polypeptides of a 
pharmacof amily . A conformer model identified by these • 
criteria can be used as a scaffold structure for developing 
a compound having enhanced binding affinity or specificity 
for polypeptides of a pharmacof amily . Such a scaffold can 
also be used to design a combinatorial synthesis producing a 
library of compounds which can be screened for enhanced 
binding affinity for polypeptide members of a pharmacof amily 
or specificity for polypeptide members of one pharmacof amily 
compared to polypeptide members of another pharmacof amily . 
An algorithm can be used to design a binding compound based 
on a conformer model including, for example, LUDI as 
described by Bohm, J. Comput . Aided Mol . Pes. .6:61-78 
(1992) . 

A conformer model need not include all atoms of a 
pharmacocluster . Thus, a conformer model can include a 
portion of atoms in a pharmacocluster so long as the portion 
consists of contiguous atoms of a. bound conformation of a 
ligand and provides sufficient information to distinguish 
one pharmacocluster from another. Thus, a conformer model 
can be constructed by overlaying corresponding fragments of 
bound conformations of a ligand and obtaining an average 
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structure according to the methods described above. A 
conformer model made from a portion of a ligand can be 
advantageous due to its small size compared to a complete 
structure of the ligand from which it was derived. A 
5 conformer model based on a portion of a bound conformation 
of a ligand can also be used to more efficiently and rapidly 
query a database due to a reduced use of computer memory 
compared to the memory required to manipulate and store a 
structure containing all atoms of the ligand. 

10 ' The invention -provides a ligand conformer model,- 

selected from the group consisting of conformer model 1 
having coordinates listed in Table 3C, conformer model 2 
having coordinates listed in Table 4C, conformer model 3 
having coordinates listed in Table 5C, conformer model 4 

15 having coordinates listed in Table 6C, conformer model 5 
having coordinates listed in Table 7C, conformer model 6 
having coordinates listed in Table 8C, conformer model 7 
having coordinates listed in Table 9C, and conformer model 8 
having coordinates listed in Table 10C. Conformer models 1-8 

2 0 are average structures calculated from pharmacoclusters 1-8 
respectively. The conformer models were determined as 
described in Example III and are shown in Figure 4. 

The invention also provides moiety, having 
coordinates listed in Table 3C, coordinates listed in Table 
25 4C,. coordinates listed in Table 5C, coordinates listed in 

Table 6C, coordinates listed in Table 7C, coordinates listed 
in Table 8C, coordinates listed in Table 9C, or coordinates 
listed in Table 10C or subsets of the respective coordinate 
sets thereof. In one embodiment the moiety is not 



nicotinamide adenine dinucleotide or nicotinamide adenine 
dinucleotide phosphate. 

Additionally, the invention provides a method for 
constructing a pharmacophore model by constructing a model 
that contains one or more selected conformation-dependent 
properties of one or more pharmacoclusters . A method for 
constructing a pharmacophore model can include the steps of 
(a) determining bound conformations of a ligand bound to 
different polypeptides; (b) identifying two or more bound 
conformations of the ligand having substantially the same 
bound conformation; (c) identifying a conformation-dependent 
property of the bound conformations of the ligand having 
substantially the same bound conformation, the conformation- 
dependent property being correlated with the bound 
conformation of the ligand, and (d) constructing a model 
that contains one or more selected conformation-dependent 
properties of one or more pharmacoclusters. 

Additionally, a method for constructing a 
pharmacophore model can include the steps of (a) determining 
bound conformations of a ligand, or portion thereof, bound 
to different polypeptides; (b) clustering two or more bound 
conformations of the ligand, or portion thereof, having 
substantially the same bound conformation, thereby 
identifying a pharmacocluster , and (c) determining an 
average structure of the bound conformations of the ligand, 
or portion thereof, in the pharmacocluster, wherein the 
average structure is a pharmacophore model. A method for 
constructing a ligand conformer model can also include the 
steps of (a) determining a bound conformation of a ligand, 
or portion thereof, bound to a polypeptide; (b) determining 
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a pharmacocluster having substantially the same bound 
conformation as the bound conformation, thereby identifying 
the bound conformation of the ligand as a member of the 
pharmacocluster, and (c) determining an average structure of 
the bound conformations of the ligand in the 
pharmacocluster, wherein the average structure is a 
pharmacophore model. 

A pharmacophore model constructed by the methods 
of the invention can be derived from any conformation- 
dependent property that is correlated with a 

pharmacocluster. An example of a pharmacophore model useful 
in the methods of the invention is a conformer model. 
Additionally, a pharmacophore model can include a portion of 
a bound conformation, wherein the portion need not contain 
contiguous atoms of a bound conformation of a ligand so long 
as the pharmacophore model provides sufficient information 
to distinguish one pharmacocluster from another. Thus, a 
pharmacophore model can appear as points in space 
unconnected by any semblance of a covalent bond due to 
absence of intervening atoms. For example, a pharmacophore 
model constructed from a pharmacocluster of nicotinamide 
adenine dinucleotide bound conformations can contain a 
phosphate moiety and nicotinamide ring moiety absent the 
ribose moiety which intervenes in a complete model of the 
structure . 

A pharmacophore model can be any representation of 
points in a defined coordinate system that correspond to 
positions of atoms in a bound conformation of a ligand. For 
example ^ a point in a pharmacophore model can correlate with 
the center of an atom in a conformer model. An atom of a 
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conformer model can also be represented by a series of 
points forming a line, plane or sphere. A line, plane or 
sphere can form a geometric representation designating, for 
example, shape of one or more atoms or volume occupied by 
5 one or more atoms . 

A pharmacophore model can be represented in any 
coordinate system including, for example, a 2 dimensional 
Cartesian coordinate system or 3 dimensional Cartesian 
coordinate system. Other coordinate systems that can be 
10 "used include a fractional coordinate system or- reciprocal 
space such as those used in crystallographic calculations 
which are described in Stout and Jensen, supra. 

In addition to a geometric description of a bound 
conformation of a ligand, a pharmacophore model can include 

15 other characteristics of atoms or moieties of the ligand 
including, for example, charge or hydrophobicity . Thus, a 
pharmacophore model can be a generalized structure, which 
includes but does not unambiguously describe the bound 
conformations of the ligand bound to the polypeptides in the 

20 pharmacof amily from which it was derived. For example, 
atoms can be represented as units of charge such that an 
oxygen in a bound conformation of a ligand can be 
represented by an electronegative point in the pharmacophore 
model. In this example, the electronegative point in the 

25 pharmacophore model includes any electronegative atom at 
that particular location including, for example, an oxygen 
or sulfur. 



A pharmacophore model can be constructed to 
include, in addition to characteristics of the ligand 
itself, characteristics of an atom or moiety that interacts 
with the ligand and from a bound polypeptide. 
5 Characteristics of an interacting polypeptide atom or moiety 
that can be included in a pharmacophore model include, for 
example, atomic number, volume occupied, distance from an 
atom of the ligand, charge, hydrophobicity , polarity, or 
location relative to the ligand. Methods for constructing a 
10 pharmacophore model to include interacting atoms from a 
polypeptide are provided in Example III. 

~£ A characteristic included in a pharmacophore model 

can be incorporated into a geometric representation using 
\\ any additional representation that can be correlated with 

^ 15 the characteristic. For example, use of color or shading 

can be used to identify regions having characteristics such 

E as charge, polarity, or hydrophobicity. As such, the depth 

U 

n{ of shading or color or the hue of color can be used to 

{j» determine the degree of a characteristic. By way of 

H 20 example, a common convention used in the art is to identify 

« regions of increased positive charge with deeper shades of 

blue, areas of increased negative charge with deeper shades 
of red and neutral regions with white. Numeric 
representations can also be used in a pharmacophore model 
25 including, for example, values corresponding to potential 
energy for an interaction, or degree of polarity. 



In addition, a pharmacophore model can incorporate 
constraints of a physical or chemical property of the bound 
conformations of a ligand in a pharmacocluster . A 
3 0 constraint of a physical property can be, for example, a 



distance between two atoms, allowed torsion angle of a bond, 
or volume of space occupied by an atom or moiety. A 
constraint of a chemical property can be, for example, 
polarity, van der Waals interaction, hydrogen bond, ionic 
bond, or hydrophobic interaction. Such constraints can be 
included in a pharmacophore model using the representations 
described above. 

A pharmacophore model can include two or more 
pharmacoclusters . In order to identify a ligand having 
broad specificity for two or more polypeptide - 
pharmacof amilies, a pharmacophore model can be derived from 
the two or more corresponding pharmacoclusters. 
Additionally, in order to identify a ligand that can 
preferentially bind a first polypeptide which belongs to a 
first polypeptide pharmacof amily compared to a second 
polypeptide of a second polypeptide pharmacof amily, a 
pharmacophore model can incorporate constraints on geometry 
or any other characteristic so as to exclude a 
characteristic of the bound conformation of the ligand bound 
to the second polypeptide. For example, a geometric 
constraint can be a forbidden region for one or more atom of 
a bound conformation of a ligand. A forbidden region can be 
identified by overlaying two conformer models in a 
coordinate system and identifying a coordinate or set of 
coordinates differentially occupied by one or more atoms of 
the conformer models. A pharmacophore model incorporating a 
forbidden region as such will be specific for a polypeptide 
of one pharmacof amily over a polypeptide of a second 
pharmacof amily correspondent with the constraint 
incorporated. 
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An advantage of the invention is that a 
pharmacophore model can be created based on multiple 
structures of the same ligand. In comparison to a 
pharmacophore model derived from a single structure or 
5 different ligands, a pharmacophore model derived from 

multiple bound conformations of the same ligand can include 
a greater degree of geometric information. For example, 
averaging of multiple bound conformations of the same ligand 
can provide torsion angle constraints that are not available 
10 from a single structure and not evident from comparing 
dif ferent- ligands . 

The invention further provides a method for 
'\! identifying a binding compound for one or more members of a 

15 polypeptide pharmacof amily by identifying a compound having 
a selected conformation-dependent property of a 
pharmacocluster . A binding compound can be any molecule 
having selected conformation-dependent properties of a 
ligand such that the binding compound can form a complex 
Hj 20 with one or more members of one or more polypeptide 

j^t pharmacof amily . A method for identifying a binding compound 

□ for one or more members of a polypeptide pharmacof amily can 

include the steps of contacting a ligand with a polypeptide 
member of a pharmacof amily ; identifying a conf ormation- 
25 dependent property associated with a bound conformation of 
the ligand bound to the polypeptide; comparing the 
conformation-dependent property of the bound conformation of 
the ligand bound to the polypeptide with a conformation- 
dependent property of a bound conformation of a ligand bound 
3 0 to another polypeptide in the same pharmacof amily ; and 
identifying a ligand bound to the polypeptide with a 
conformation-dependent property similar to a bound 
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conformation of a ligand bound to another polypeptide in the 
same pharmacof amily , thereby identifying a compound that 
binds one or more polypeptide members of a pharmacof amily . 
A compound that binds to one or more members of a 
polypeptide pharmacof amily can be identified by determining 
a conformation-dependent property by any of the methods 
described herein. For example, a ligand conformation or 
spectroscopic signal can provide a conformation-dependent 
property useful in identifying a compound that binds to one 
or more members of a polypeptide pharmacof amily . 

The methods described herein for identifying a 
binding compound for one or more members of a polypeptide 
pharmacof amily can readily be adapted to a high throughput 
screening method. For example, methods of rapidly detecting 
a conformation-dependent property in a sequence of samples 
or detecting a conformation-dependent property in parallel 
samples can be applied to a high- throughput screen. One 
skilled in the art will know how to adapt the methods 
described here to a high throughput screening format using, 
for example, robotic manipulation of samples. 

A method for identifying a binding compound for 
one or more members of a polypeptide pharmacof amily can 
include the steps of determining a bound conformation of a 
ligand bound to a polypeptide member of a polypeptide 
pharmacof amily ; comparing the bound conformation of the 
ligand bound to the polypeptide member of the polypeptide 
pharmacof amily to a pharmacophore model; and identifying the 
bound conformation of the ligand bound to the polypeptide 
member of the polypeptide pharmacof amily that satisfies the 
constraints of the pharmacophore model as a binding compound 
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for one or more members of the pharmacof amily in which the 
polypeptide member belongs. 

A pharmacophore model can be useful in querying a 
database of polypeptide structures to find other members of 
5 a polypeptide pharmacof amily . For example, a member of a 
polypeptide pharmacof amily can be identified by querying a 
database of bound conformations of a ligand to retrieve a 
structure that fits the constraints of the query 
pharmacophore model, thereby identifying the retrieved 
10 polypeptide as a member of the pharmacof amily from which the 
pharmacophore model was derived. A pharmacophore model can 
also be used to identify a new member of a polypeptide 
S! pharmacof amily by querying a database of one or more 

Z\ polypeptide structures using an algorithm that docks or 

f* 15 compares the pharmacophore model to polypeptide structures, 

jh wherein a favorable docking or comparison identifies a 

s polypeptide as a member of the same polypeptide 

Li. 

ns pharmacof amily from which the pharmacophore model was 

[U derived. The database queries described above can be 

ni 

p'i 20 performed with algorithms available in the art including, 
O for example, THREEDOM and CATALYST. 

An advantage of the invention is that a 
pharmacophore model can also be used to identify a binding 
compound that is specific for polypeptides of one or more 

25 pharmacof amilies . For example, a pharmacophore model can be 
compared to a structure of a compound or to a bound 
conformation of a ligand to identify those having similar 
properties. A conformer model can be further used to query 
a database of compounds to identify individual compounds 

30 having similar properties. 



A pharmacophore model of the invention can also be 
used to design a binding compound that is specific for 
polypeptides of one or more pharmacof amilies . A 
pharmacophore model identified by these criteria can be used 
as a scaffold or set of constraints for developing a 
compound having enhanced binding affinity or specificity for 
polypeptides of of one or more pharmacof amilies . Using 
similar methods a pharmacophore model can be used to design 
a combinatorial synthesis producing a library of compounds 
having properties consistent or similar to the model which 
can be then be screened for enhanced- binding- affinity or 
specificity for polypeptide members of one or more 
pharmacof amilies . An algorithm can be used to design a 
binding compound based on a pharmacophore model including, 
for example, LUDI as described by Bohm, J. Comput . Aided 
Mol. Pes. 6:61-78 (1992). 

A compound can be identified as satisfying the 
constraints of a pharmacophore model by a variety of methods 
for comparing structures. For example, a pharmacophore 
model that is a geometric representation such as a conformer 
model can be overlaid with a compound, and the best fit 
determined as described herein. Substantial overlap between 
a compound and a pharmacophore model can be indicated by a 
visual comparison and/or computation based comparison based 
on for example, RMSD values or torsion angle values as 
described above. In a case where a pharmacophore model is 
represented by constraints, a compound can be fitted to the 
pharmacophore model to identify if the properties of the 
compound satisfy the constraints of the pharmacophore model. 
For example, if a pharmacophore model contains, as a 
constraint, a maximum distance between atoms, a compound 



68 

that satisfies the constraint can be identified as having a 
bond distance between corresponding atoms that is at least 
the maximum value. One skilled in the art will know how to 
extend such methods of comparison to any physical or 
5 chemical constraint. 

A compound can also be identified as satisfying 
the constraints of a pharmacophore model by demonstrating 
the same characteristics for one or more specific atom 
located within a volume of space defined by the geometric 

10 constraints of the pharmacophore model. For example, in a 
case where polarity is a constraint and where a conformation 
of a compound can be overlaid with a pharmacophore model, an 
atom that overlaps a volume of space indicated by the 
pharmacophore and having polarity within the defined limits 

15 can be identified as satisfying constraints of the 

pharmacophore. By extension, a compound having atoms which 
satisfy all constraints of a pharmacophore is identified as 
a binding compound for one or more members of a polypeptide 
pharmacof amily from which the pharmacophore was produced. 

20 Therefore, the invention provides a binding 

compound identified by the above described methods. For 
example, the invention provides a binding compound 
identified using a pharmacophore model or a conformer model 
derived from a pharmacocluster and/or pharmacof amily . 

25 The invention provides a pharmacophore model, 

selected from the group consisting of pharmacophore model 1 
having coordinates listed in Tables 3B and 3C, pharmacophore 
model 2 having coordinates listed in Tables 4B and 4C, 
pharmacophore model 3 having coordinates listed in Tables 5B 
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and 5C, pharmacophore model 4 having coordinates listed in 
Tables 6B and 6C, pharmacophore model 5 having coordinates 
listed in Tables 7B and 7C, pharmacophore model 6 having 
coordinates listed in Tables 8B and 8C, pharmacophore model 
5 7 having coordinates listed in Tables 9B and 9C, and 

pharmacophore model 8 having coordinates listed in Tables 
10B and IOC. 

The invention also provides a medium comprising a 
storage medium and stored in the medium, atom coordinates 
selected from the atomic coordinates listed in Table 3B, 3C, 
4B, 4C, 5B, 5C, 6B, 6C, 7B, 7C, 8B, 8C # 9B, 9C, 10B or IOC, 
or a subset thereof. In one embodiment the medium comprises 
a computer readable medium. The use of a computer apparatus 
is convenient since atomic coordinates can be conveniently 
stored and accessed for manipulation including, for example, 
docking to a polypeptide structure or comparison to 
coordinates for other bound conformations of a ligand. 
Exemplary methods for manipulating atomic coordinates are 
described above . 

20 It is understood that a computer apparatus of the 

invention need not itself store atomic coordinates of the 
invention. The computer apparatus contains an algorithm for 
viewing a structure from the coordinates or otherwise 
manipulating the coordinates. By using various hardware, 

25 software and network combinations, the atomic coordinates 
can be manipulated in a variety of configurations. Such a 
separate medium can be another computer apparatus, a storage 
medium such as a floppy disk, Zip disk or a server such as a 
file-server, which can be accessed by a carrier wave such as 

30 an electromagnetic carrier wave. One skilled in the art 
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will know or can readily determine appropriate hardware, 
software or network interfaces that allow interconnection of 
an invention computer apparatus. 

The methods of the invention described herein can 
be performed in a computer apparatus using the atomic 
coordinates listed in Table 3B, 3C, 4B, 4C, 5B, 5C, 6B, 6C, 
7B, 7C, 8B, 8C, 9B, 9C, 10B or 10C by adding the step of . 
entering the coordinates or a subset of the coordinates to 
the computer apparatus that performs a method of the 
invention. One skilled in the art will know or can readily 
determine an algorithm instructing a computer apparatus to 
carry out the methods of the invention. 

It is understood that modifications which do not 
substantially affect the activity of the various embodiments 
of this invention are also provided within the definition of 
the invention provided herein. Accordingly, the following 
examples are intended to illustrate but not limit the 
present invention . 

EXAMPLE I 

Identification of Polypeptide Pharmacof amilies Based on 
Bound Conformations of NAD(P) (H) Ligands 

This example describes identification of ligand 
conformer groups and corresponding polypeptide 
pharmacof amilies based on bound conformations of NAD (P) (H) 
bound to polypeptide oxidoreductases . 



The oxidoreductases form a family of polypeptides 
that bind NAD (H) and NADP(H) . In order to identify 
pharmacof amilies within the family of oxidoreductases, bound 
conformations of NAD(P) (H) were determined by searching the 
protein databank. Bound conformations from 156 structures 
were clustered into separate pharmacoclusters, and 
pharmacof amilies were identified according to binding to 
bound conformations of NAD (P) (H) in separate 
pharmacoclusters . 

Structure files containing polypeptides with bound 
NAD (P) (H) were identified from the protein databank by 
keyword searches using the database software. Keywords 
included "NAD," "NADH," "NADP," "NADPH," "oxidoreductase , " 
"dehydrogenase" and "reductase." Cluster analysis was 
performed using the algorithm COMPARE (Chiron Corp, 1995; 
distributed by Quantum Chemistry program Exchange, 
Indianapolis IN) in combination with visual inspection. 
All clusters were visually inspected using Insight 98 for 
outliers that demonstrated poor overlay with the rest of the 
pharmacocluster as a whole. These outliers were compared 
against each other and existing pharmacoclusters to find 
other possible matches. Those that did not fit any family 
were removed. Comparison between bound conformations was 
made based on the RMSD equations supplied in COMPARE. 

Eight pharmacoclusters were identified by this 
method, as shown in Figure 1. Visual inspection of the 
clusters in Figure 1 demonstrates that members within a 
cluster are substantially overlapped. Comparison between 
clusters demonstrates substantial differences. For example, 
the bound conformations in cluster 5 have an extended 
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structure compared to the bound conformations in cluster 4, 
which form a horseshoe like shape. Other differences 
include, for example, a flip in the nicotinamide ring 
between cluster 1 and cluster 2 such that the nicotinamide 
ring is anti to the ribose in cluster 1 and syn to the 
ribose in cluster 2 and a change in torsion angle in the 
bonds connecting the adenine ribose to the adenine phosphate 
for the bound conformations of cluster 3 compared to those 
of cluster 2. 

-- The dihedral angles for various bonds in the bound 

conformations of the NADP(H) ligand can be used to 
distinguish the pharmacoclusters . As shown in Table 1 (see 
Figure 2 for atom and bond locations) , although many 
dihedral angles are similar between two or more 
pharmacoclusters, each pharmacocluster can be distinguished 
from the others by comparison of the full set of dihedral 
angles. For example, pharmacoclusters 2 and 3 can be 
distinguished by comparison between the dihedral angles at 
04 1 A-C4 ■ A-C5 ' A-05 ' A which are 154 degrees and -131 degrees 
respectively and by comparison between the dihedral angles 
at C5 ' A-05 ' A-PA-03 which are 105 degrees and 57 degrees 
respectively. 
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A quantitative analysis of the results of 
clustering bound conformations of NAD (P) (H) is provided in 
Table 2 . Table 2 shows RMSD values calculated from 
comparisons between each pharmacocluster ' s average 
coordinates. Average coordinates were determined from the 
pharmacocluster subsets listed in Tables 3 through 10 as 
described below. 
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Tables 3A, 4A, 5A, 6A, 7A, 8A, 9A and 10A show 
RMSD values for subsets of members of pharmacoclusters 1-8, 
respectively. The RMSD values for each member were 
calculated as comparisons to an average structure for the 
subsets shown in each table respectively. For each 
pharmacocluster a subset of the possible ligands that belong 
to each cluster were identified. Each subset was chosen to 
maximize the diversity of the family and to minimize over- 
representation of ligand conformations from enzymes that 
exist multiply in the PDB database. The goal of the subset 
selection was to fully represent characteristics, from , 
oxidoreductases belonging to a range of species and 
catalyzing a range of different reactions. For example, 
there exists over ten alcohol dehydrogenases in the PDB 
database; however, for purposes of this study, only three 
were chosen from three different species for use in the 3D 
overlay and the pharmacophore construction. Average 
coordinates for the above described pharmacocluster subsets 
were obtained by overlaying ligand structures in MSI 
Insight I I using the overlay function. The three dimensional 
coordinates for each atom in each ligand were used to 
calculate an average position and a standard deviation for 
the pharmacof amily . 

Comparison of the RMSD values in part A of Tables 
3 through 10 with the RMSD values in Table 2 demonstrate 
that a member of a pharmacocluster can be identified as 
having a lower RMSD compared to an average conformation of 
the members in its pharmacocluster than the RMSD between 
each family's average coordinates. In some cases it can be 
beneficial to combine two or more methods of comparison. 
For example, as described above pharmacoclusters 2 and 3 



which have a relatively low RMSD when compared to each other 
can be distinguished from each other by visual inspection 
and by comparison of dihedral angles at various bonds. 

These results demonstrate that bound conformations 
of a ligand can be grouped into pharmacoclusters by methods 
of structure comparison. These results also demonstrate 
methods for distinguishing pharmacoclusters and members 
within pharmacoclusters. 

^ - Example II 

Correlation Between the Structure of Polypeptides and the 
Bound Conformations of NAD (P) (H) 

This example describes a correlation between bound 
conformations of NAD (P) (H) and structural classification of 
polypeptides such that polypeptides of a pharmacof amily have 
similar protein fold. 

Pharmacoclusters for conformations of NAD (P) (H) 
bound to oxidoreductase polypeptides were clustered as 
described in Example I. For each polypeptide the protein 
fold, SCOP super-family designation and SCOP family 
designation was identified from the SCOP website 
administered by Laboratory of Molecular Biology at -the MRC, 
Cambridge England (http://mrc-lmb.cam.ac.uk). 

Table 11 shows the grouping of NAD (P) (H) binding 
polypeptides into 8 pharmacof amilies . 
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The results shown in Table 11 demonstrate that 
bound conformation of NAD (P) (H) can be correlated with 
protein fold. Grouping oxidoreductases into 
pharmacof amilies based on the bound conformations of 
NAD(P) (H) resulted in a correlation with protein fold. 
Pharmacof amilies 1-3 consist of polypeptides having the 
NAD (P) (H) binding Rossman fold. Pharmacof ami ly 4 consists 
of polypeptides having heme- linked catalase fold. 
Pharmacof ami ly 5 consists of polypeptides having the (3-a TIM 
barrel fold. Pharmacof amily 6 consists of polypeptides 
having the dihydrof olate reductase fold. Pharmacof amily 7 
consists of polypeptides having the FAD/NAD (P) (H) binding 
domain fold. Trypanathione reductase was added to family 7 
by homology of its active site to the active sites of other 
members of pharmacof amily 7 independent of bound ligand 
conformation. Pharmacof amily 8 consists of polypeptides 
having the ferrodoxin like fold. Pharmacof amilies 1 and 2 
were identified based on anti or syn conformation, 
respectively, of the nicotinamide ring relative to the 
ribose. Additionally, a change in the torsion angles in the 
bonds connecting the adenine ribose to the adenine phosphate 
separates the family members having a Rossman fold into a 
third pharmacof amily, identified as pharmacof amily 3 . 

The results described in this example demonstrate • 
that a bound conformation of a ligand can be correlated with 
polypeptide fold. Furthermore, the results obtained by the 
method are consistent with results obtained by SCOP. 
Therefore, classification based on bound conformation of 
ligands can be used to classify polypeptides according to 
structure . 



EXAMPLE III 

Determination of a conformer model and pharmacophore for 

pharmacoclus ters 1-8 



This example demonstrates determination of the 
average bound conformations from pharmacoclusters 1-8 and 
construction of conformer models based on the average bound 
conformations. This example also demonstrates construction 
of a pharmacophore model based on the average bound 
conformations and interactions with polypeptides. 

Conformer models for each pharmacocluster were 
produced by determining an average structure for the subset 
of members of each pharmacocluster as described in Example 
I. The coordinates for conformer models of pharmacoclusters 
1-8 are shown in Part C of Tables 3-10 respectively. 

Pharmacophore models were constructed by aligning 
the active sites of a pharmacof amily of oxidoreductases . 
Three-dimensional overlays were achieved using Insight II 
overlay module to overlay the NAD (P) ligands of each enzyme- 
ligand complex. Heteroatoms in the surrounding protein that 
could function as hydrogen bond acceptors or hydrogen bond 
donors were identified in each complex that made 
interactions with the NAD (P) ligand. These heteroatoms that 
had common positions in three dimensional space (within 3A 
of each other in the overlay) in each enzyme complex and 
that made a common interaction with the ligand were then 
grouped together and tabulated for pharmacophore 
construction. Water molecules were similarly identified and 
grouped. The grouped heteroatoms and water molecules are 
listed in Part D of Tables 3-10 below. Finally the average 
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coordinates and the standard deviation for each interaction 
group were calculated. The final pharmacophore model was 
produced by overlaying interaction groups on the conformer 
model (average ligand structure) . 

The coordinates for pharmacophore models of 
pharmacoclusters 1-8 are shown in parts B and C of Tables 3- 
10, respectively. Specifically, each conformer model 
includes the average NAD(P) coordinates (in part C of each 
Table) and the pharmacophore model includes both the average 
NADP coordinates, average water coordinates and the- average 
protein heteroatom coordinates (including coordinates in 
both part B and C of each Table) . An exception is the 
pharmacophore model derived from pharmacof amily 7 which 
includes average water coordinates and average protein 
heteroatom coordinates for all polypeptides listed but has a 
conformer model derived from NAD (P) bound to each 
polypeptide listed except trypanathione reductase. 

A structural representation of each conformer 
model with overlayed interaction groups used to determine 
respective pharmacophore models 1-8 is provided in Figure 3. 
The structures shown in Figure 3 reflect the average NAD(P) 
coordinates shown in Part C of Tables 3-10 and the 
coordinates for all interacting groups used to calculate the 
average water coordinates and the average protein heteroatom 
coordinates as shown in Part D of Tables 3-10. Hydrogen 
bond acceptors are labeled with an 'A' followed by a number 
for each group. These are listed in the pharmacophore 
Tables and designated on the pharmacophore figures. Donors 
are labeled with a X D' ; and water molecules are labeled with 
a *W . 



This example demonstrates construction of 
conformer models based on the bound conformations of ligands 
in pharmacoclusters . This example also demonstrates 
construction of a pharmacophore model based on the bound 
conformations of ligands in pharmacoclusters and their 
interactions with polypeptides in their respective 
pharmacof amilies . 

Example IV 

Correlation Between the Bound Conformation of Ligands and a 
Conformation-Dependent Property 

This example describes a conformation-dependent 
property that is correlated with a bound conformation of a 
ligand . 

A 2D [ X H , X H] NOESY spectrum was recorded with a 
0.2 ml sample of 1 mM NADP and 200 jiM of enzyme 1-deoxy D- 
xylulose 5 -phosphate reductoisomerase (DOXP) . The spectrum 
was" measured with a Bruker DRX700 spectrometer operating at 
700 MHZ X H frequency. The total measuring time was about 12 
h. 

The spectrum is shown in Figure 4 and atoms are 
identified according to Figure 2. The relative intensities 
of the observed transferred NOEs (trNOEs) between the ribose 
proton H-Cl'N(NCl') and the protons on the nicotinamide 
ring, H-C4N and H-C2N shown in Figure 4, reveal that the 
NADP adopts a syn conformation when bound to the enzyme. 



The bound conformations in Pharmacocluster 1 and 2 
can be distinguished according to anti or syn conformation, 
respectively, of the nicotinamide ring relative to the 
ribose . Therefore, these results demonstrate that the 
relative intensities of the observed trNOE's between the 
ribose proton H-Cl'N(NCl') and the protons on the 
nicotinamide ring, H-C4N and H-C2N can provide a 
conformation dependent property useful in distinguishing 
members of pharmacoclusters 1 and 2 . 

Example V - 
Binding compounds having specificity for one or more 
polypeptide pharmacof amilies . 

This example demonstrates querying a database of 
compounds to identify individual compounds having similar 
conformations. This example also demonstrates preferential 
binding of a compound to a polypeptide o # f one pharmacof amily 
over another. 

The TTE0001 . 001.A07 AND TTE0001 . 002 . DO 2 compounds 
were identified by using the THREEDOM algorithm to query a 
database of commercially available molecules (ASINEX; 
Moscow, Russia) by shape matching with cibacron blue. 
Coordinates of cibacron blue were obtained from the 
published 3D structure (Li et al., Proc . Natl. Acad. Sci. 
USA 92:8846-8850 (1995)). The database was created by 
converting an SD format file of structures from ASINEX to 
INTERCHEM format coordinates using the batch2to3 program. 
Cibacron blue was compared against each structure in the 
database in multiple orientations to generate a matching 
score. Out of 37,926 structures searched, the 750 best 
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matching scores were selected. From these 750 structures, 
TTE0001.001.A07 AND TTE0001 . 002 . DO 2 were selected and 
purchased based on objective criteria such as likely 
favorable binding interactions, pharmacophore properties, 
synthetic accessibility and likely pharmacokinetic, 
toxicological, adsorption and metabolic properties. 

Kinetic studies were carried out in 1-cm cuvettes 
in a 1 mL volume at 25°C. Lactate dehydrogenase reactions 
were monitored spectrophotometrically with a Gary 300 by 
following the decrease in absorbance^ at 340 nm due to the ~ 
oxidation of NADH by pyruvate. Lactate dehydrogenase 
reaction mixtures contained 100 mM Hepes buffer at pH 7.4, 
as well as 2.5 mM pyruvate, 10 |^M NADH, 5 ng/mL lactate 
dehydrogenase. NADPH, NADH, Hepes buffer, and rabbit muscle 
lactate dehydrogenase were purchased from Sigma. Cytochrome 
P450 reductase reactions were monitored by following the 
decrease in absorbance at 550 nm due to the reduction of 
ferric cytochrome c by NADPH. Cytochrome P4 50 reductase 
reaction mixtures contained 100 mM Hepes buffer at pH 7.4, 
as well as 80 (xM ferric cytochrome c, 10 |iM NADPH, and 80 
ng/mL cytochrome P450 reductase. Data were fitted using the 
FORTRAN programs of Cleland, Adv. Enzymol. 45: 273-387 
(1977) which perform nonlinear least squares fits to the 
appropriate equations. Substrates were varied around their 
Michaelis constants, while nonvaried substrate was kept at a 
concentration close to its Michaelis constant. The 
concentration of inhibitor that gives 50% inhibition (IC50) 
values were obtained by fitting data to the equation for a 
line, where Y values are l/rate and X values are the 
concentration of inhibitor, as in a Dixon plot (Segel, 
supra). The X-intercept is the IC50. If a full kinetic 
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profile was done, then K is values were obtained by fitting 
the data to the equation for a competitive inhibitor: 

V max A 

rate = 

KJl + I/K is ) + A 

where rate is the rate of reaction in units of 
absorbance/minute, V max is the maximum velocity, 1^ is the 
Michael is constant for A, K is is the inhibition dissociation 
constant for the inhibitor, I is the inhibitor - • • 

concentration, and A is the concentration of NADH or NADPH. 
In all cases, the fit to the above equation was used only 
after establishing that the fit to equations for 
noncompetitive and uncompetitive inhibition were less 
appropriate based on values for sigma (overall fit) as well 
as standard deviations for fitted constants (K is and K Ai ) . 

As shown in Figure 5, compound TTE0001 . 001 . AO 7 
could inhibit binding of NADH to lactate dehydrogenase and 
NADPH to cytochrome P450 reductase which are polypeptide 
members of pharmacof amily 1 and 8 respectively. Compound 
TTEO 0 0 1 . 0 0 1 . AO 7 demonstrated high binding affinity for both 
lactate dehydrogenase and cytochrome P4 50 reductase. 

Analysis of inhibition of binding between NADH and 
lactate dehydrogenase is shown in Figure 6 . Compound 
TTE0001.002.D02 inhibited lactate dehydrogenase with a K is 
of 2.1 yM. Similar measurements of cytochrome P450 
reductase with concentrations of compound TTEO 001 . 002 . DO 2 up 
to 0.5 mM did not indicate inhibition. These results 
indicated that compound TTE0001 . 002 . D02 had a K is of greater 



than 1 mM with cytochrome P450 reductase. Thus, compound 
TTE0001 . 002 . DO 2 demonstrated preferential binding for 
pharmacof amily 1 having an inhibitory dissociation constant 
(K is ) that was at least 500 fold lower than for 
pharmacof amily 8. 

The results described in this example demonstrate 
that a binding compound can be identified by structural 
comparison to a bound conformation of a ligand. 
Furthermore, the results demonstrate that binding compounds 
that interact with polypeptides from multiple 
pharmacof ami lies or compounds that preferentially bind to 
polypeptides of one pharmacof amily compared to polypetides 
of another pharmacof amily can be identified by structural 
comparison to a bound conformation of a ligand. 

Example VI 

Identification of a ligand using a pharmacophore model 

This example demonstrates construction of a 
pharmacophore model, use of the model to identify a binding 
ligand and confirmation of the ability of the identified 
compound to bind a polypeptide member of the pharmacof amily 
from which the pharmacophore model was derived. 

Pharmacophore models were constructed to include 
part or all of the NAD(P) shape, hydrogen bond donors, 
hydrogen bond acceptors and/or other chemical features 
described in Tables 3-10. The combination of chemical 
features chosen for each search pharmacophore in a search 
set were chosen in an attempt to cover a diverse range of 
combinations of possible chemical interactions and to 



represent the protein ligand interactions that occur most 
frequently in the particular pharmacof amily . 

Pharmacophore shape was derived using the program 
CATALYST, and was calculated using the Van der Waals surface 
for part or all of the structure of the averaged NAD(P) 
coordinates determined for a pharmacocluster . Desired 
hydrogen bonding features, water molecules and other 
chemical motifs were positioned in the pharmacophore model 
using the average coordinates determined for both the 
pharmacof amily and pharmacocluster . 

The components of a pharmacophore model derived 
from the coordinates presented in Table 3 for pharmacof amily 
1 are shown in Figure 7 . Figure 7A shows the structure for 
the conformer model having coordinates listed in Table 3C 
with a superimposed volume defining the shape of the ligand 
and indicated by grey spheres. A hydrophobic feature was 
added to the pharmacophore model at the average position of 
the hydrophobic region of the nicotinamide ring as shown in 
Figure 7B. Also shown in Figure 7B is a hydrogen bond 
acceptor positioned at the average coordinates for the 
pyrophosphate using the averaged coordinates for the 
location of hydrogen bond acceptors utilized in all of the 
17 polypeptides of the pharmacof amily . Finally, Figure 7B 
shows a hydrogen bond donor positioned according to a 
position where a hydrogen bond donor of a ligand would be 
expected to have favorable interactions with hydrogen bond 
acceptors observed in 11 of the polypeptides of 
pharmacof amily 1. Thus, the hydrogen bond donor does not 
identify a position of an actual hydrogen bond donor in the 
NAD(P) ligand, but instead a location to where a potential 



ligand's hydrogen bond donor could make favorable 
interactions with the polypeptides of pharmacof amily 1. 
Figure 7C shows the combined features of figures 7A and 7B 
present in a pharmacophore model used to search a database 
of compounds . 

To identify potential ligands that bind to 
polypeptides of pharmacof amily 1, computational searches 
were conducted using CATALYST. Searches were made by 
comparing the shape and combination of chemical features of 
the pharmacophore model, shown in Figure 7, to the. shape and 
features of molecules in the database. 

An example of a compound identified using the 
pharmacophore model shown in figure 7C is TTE0008 . 025 . D08 . 
Using a binding assay similar to that described in Example 
V, compound TTE0008 . 025 . D08 was shown to have inhibitory 
activity against pharmacof amily 1 member, 

dihydrodipicolinate reductase (IC 50 = 2.8 |iM) . 
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Coordinates for the conformer and pharmacophore 
models and data used in their construction is presented in 
Tables 3-10 above. Part A of each Table lists subset of 
structures used in constructing the model including molecule 
5 numbers for cross-referencing between parts A-C, the PDB 
accession number, the name of the polypeptide, and the RMSD 
from the pharmacocluster average. Part B of each Table 
lists the average coordinates for heteroatoms and waters of 
the pharmacophore model and includes the atom name (cross 
10 referenced to part D) , designation of interaction ("ACC," 
acceptor; " DON , " . donor ; and "WAT , " water)/ total number of 
atoms included in the calculation of the average, and X, Y, 

□ 

Z coordinates with respective standard deviations (a) . Part 
]i C of each Table lists the coordinates of the conformer model 

ens 

15 using the atom designations of Figure 2 and X, Y, Z 

coordinates with respective standard deviations (a). Part D 
of each Table lists the coordinates for interacting 
molecules used to determine the pharmacophore model 
including the atom name, residue molecule # (which 
2 0 identifies the residue type and molecule number cross - 
~f referenced to Part A) , residue number from the PDB 

structure, total number of atoms summed for the average 
coordinates, and X, Y, Z coordinates with respective 
standard deviations (a) . The bolded entries in part D 
25 correspond to the average values reported in part B. Atom 
names are identified according to IUPAC recommendations as 
described for example in Markley et al . , Pure and Appl . 
Chem. 70:117-142 (1998). 
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Throughout this application various publications 
30 have been referenced. The disclosures of these publications 
in their entireties are hereby incorporated by reference in 



# 

183 

this application in order to more fully describe the state 
of the art to which this invention pertains. 

Although the invention has been described with 
reference to the disclosed embodiments, those skilled in the 
5 art will readily appreciate that the specific details are 
only illustrative of the invention. It is understood that 
modifications which do not substantially affect the activity 
of the various embodiments of this invention are also 
included within the definition of the invention provided 
10 herein. Therefore, it should be understood f that various 

modifications can be made without departing from the spirit 
of the invention. Accordingly, the invention is limited 
only by the following claims. 



